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Still much to learn about mice 


A project that aims to mutate every gene in the mouse genome to improve our knowledge of mouse 
biology should help to avoid irreproducible results and costly failures in drug development. 


achieved its royal status after it was chosen as the first mam- 

mal — after the human — to have its genome sequenced. 
Understanding the genome made it possible to develop new molecu- 
lar technologies to make mutant mice, and scientists have made them 
by the thousand. They have also used these mutant mice to illumi- 
nate how genes and the molecular pathways they control operate 
in health and disease. This has also cast some welcome, if indirect, 
light on human diseases. 

Specialized repositories have sprung up around the world to 
accommodate these mutant mice and to allow them to be shared. 
Everybody benefits: researchers, who can have the latest mouse 
mutants sent to them; and science more broadly, as the repositories 
guarantee the quality of the genetics and the health of each strain, 
which is crucial for comparing the results of different experiments. 

That quality must be defended. At a meeting in Munich, Ger- 
many, earlier this month, representatives of repositories from China, 
the United States, Europe, Japan, Canada and elsewhere expressed 
a concern: new technology renders it so easy to make a knockout 
mouse that more scientists may start to use it, without being aware 
of the general genetics expertise needed. It is a concern that deserves 
broader discussion. 

At present, the ability to make a high-quality knockout of a gene 
in a mouse requires considerable skill in genetics and breeding 
techniques. But new and disruptive technologies — gene-editing 
methods such as CRISPR — have entered the scene, making mouse 
engineering considerably less challenging. But will this mean a series 
of strains produced with inadequate quality control? If so, experi- 
ments will be harder to reproduce, and medical research could suffer. 

Since 2010, those involved with the mouse repositories, together 
with other geneticists, have been coordinating the International 
Mouse Phenotyping Consortium (IMPC). The consortium aims to 
make a conditional mouse mutant — in which the targeted gene can 
be switched off to order — for every gene in the mouse genome in 
a defined genetic background. Each mutant mouse will be exam- 
ined in detail to find out exactly what changes occur in the animal’s 
physiology, anatomy or behaviour when the gene is removed. It is a 
colossal task, with a colossal estimated price tag of US$900 million 
to be shared by participating nations. 

The first thousand of these phenotyped mutants will be available 
in a couple of months. Fifteen thousand will be available by 2021 
if all goes to plan. But that plan assumes that the requisite funding 
will continue to flow. And like all those with power, the mouse has 
enemies, whose views may shake the confidence of funding agencies, 
already notoriously averse to large, long-term investments such as 
repositories. Mouse mutants are invaluable in understanding bio- 
logical processes and what can go wrong in biochemical or cellu- 
lar pathways in diseases such as cancer or Alzheimer’s. Too often, 


T= mouse is the undisputed king of laboratory science. It 


however, scientists consider them models of human disease, as ifa 
manipulated gene or two could actually recapitulate a disease in a 
different species. Therapies that ‘cure’ a mutant mouse but then fail 
in the clinic, bring the mouse into disrepute — as recently lamented 
by Steven Perrin, of the Amyotrophic Lateral Sclerosis Therapy 
Development Institute in Cambridge, Massachusetts, who has wit- 
nessed the phenomenon too many times in relation to this disease 
(Nature 507, 423-425; 2014). 


“Therapies that Some scientists complain that the pheno- 
cure amutant typing approach is unreliable because 
mouse but then when different laboratories knock out the 
failinthe clinic, same gene, they may see different conse- 
bring themouse quences — adding to the current crisis in 


the reproducibility of biomedical results. 
But the discrepancies are usually because 
mice in different labs are of different genetic strains; this makes a 
big difference to whether the function of a missing gene will be com- 
pensated for. Another cause of discrepancies can be viruses in the 
mice, which can change the way that genes are expressed. In fact, 
12% of the strains submitted to one of the main mouse repositories, 
the Jackson Laboratory in Bar Harbor, Maine, are contaminated by 
pathogens. 

This is why it is so desirable to have repositories that guard the 
health and genetic quality of the deposited mice. It is also why the 
IMPC is so important — by detailing the function of each gene ina 
standard genetic background, it will provide a necessary source of 
information for researchers for many decades, and help in the effort 
to ensure that biological results are reproducible. = 


into disrepute.” 


Not on the label 


A US push to flag foods as genetically 
engineered is hard to swallow. 


particularly those with a fondness for its maple syrup and ice 
cream. On 8 May, Vermont carved out a new position in the 
national food scene when its governor, Peter Shumlin, signed into 
law a bill that requires foods on sale in the state that were made with 
genetically engineered ingredients to be labelled as such. It is the first 
such law in the country. 
The law’s fate is unclear: food-industry groups immediately vowed 
to challenge it in court. Vermont’s attorney-general is readying the 
state’s legal defence — the bill Shumlin signed included provisions to 
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fund these courtroom battles. It is tempting to see Vermont's move as 
the first success in a larger US movement that aims to limit the spread 
of genetically modified foods. 

The Center for Food Safety, a consumer activist group in Washing- 
ton DC, says that there are 35 similar food-labelling bills in the works 
across 16 states. That is not to say that all will follow in Vermont's 
footsteps. In 2012 and 2013, voters in California and Washington state 
defeated similar ballot measures. Vermont — the only state to boast a 
self-described socialist as a senator — is something of an outlier on the 
US political spectrum. But the interest in laws on labelling is a striking 
trend in a country that is the world’s leading producer of genetically 
engineered crops. 

There is plenty of precedent for such laws: more than 60 countries 
require genetically engineered foods to be labelled. Many of those 
countries grow few, ifany, genetically engineered crops. The US label- 
ling movement poses a number of logistical challenges. Navigating a 
patchwork nation in which labelling requirements vary from state to 
state is one obvious problem for the food industry. The sheer pervasive- 
ness of genetically engineered crops in all manner of foods is another. 

In 2013, such crops populated about half of US farmland. That 
included more than 400,000 hectares of sugar beets modified to 
withstand the herbicide glyphosate. By 2010, some 95% of the US 
sugar-beet crop was genetically engineered, and more than half of 
the processed sugar made in the country derives from sugar beets. 
Although neither the genetically engineered DNA nor protein remains 
in the finished product, laws proposed in some states would require 
that foods containing this sugar be labelled as ‘genetically modified’ 

And so it would go for most genetically engineered crops, which 
make their way onto the dinner table largely by way of processed foods. 
Herbicide-tolerant corn (maize) appears as the sweetener high-fruc- 
tose corn syrup, and engineered soya beans are used to make the com- 
mon food additive soy lecithin. Corn oil made from engineered corn 
is chemically no different from that made with conventionally bred 
corn. Yet some proposed laws would require a frozen pizza drizzled 
with corn oil made from genetically engineered corn to be labelled as 
‘genetically modified’ 


The definition of that term is set to become even fuzzier as new 
technologies widen the array of genetic modifications available to crop 
breeders. Some are experimenting with ‘cisgenics’ — the science of 
modifying a crop by expressing genes plucked from related species. 
Methods that alter gene expression using RNA molecules are also in 
vogue. And advances in genetic engineering have yielded ways to pre- 
cisely edit the genome, inserting genes at specific locations. These 
methods allow just a few letters of the DNA sequence to be changed. 

It is a far cry from the days when genes that 


“De termining conferred insect resistance or herbicide toler- 
the provenance ance were taken from a bacterium and shot 
of some near-randomly into crop genomes. Yet while 
engineered regulators are deep in discussions about 
crops may be how to handle the new varieties of geneti- 


cally engineered foods, popular concep- 
tions of such foods seem largely unchanged. 
Vermont's labelling law and many of the other proposals make no 
distinction: products of a crop engineered using recombinant DNA 
techniques to make heritable changes to the genome are to be labelled, 
regardless of whether that change was one that could have been pro- 
duced through conventional methods such as breeding with relatives 
or exposing seeds to mutagens. It is also not clear whether the labelling 
laws could be enforced: determining the provenance of some of these 
engineered crops may be impossible, because the products will be 
indistinguishable from those made using conventional crops. 

The issue of genetically engineered foods is a muddled one, and 
the debate surrounding them is heated. Some oppose the technol- 
ogy because they oppose industrialized agriculture; others worry that 
engineered crops could pose environmental hazards. And many con- 
sumers believe, despite evidence to the contrary, that the foods pose 
more health risks than those grown through conventional breeding 
and mutagenesis. 

Researchers may understandably be hesitant to plunge into these tur- 
bulent waters. But the popular discourse around genetically engineered 
crops is in dire need ofa scientific update. Without it, public discussion 
and political legislation will continue to drift away from reality. m 


impossible.” 


Out with a bang 


The discovery of a Wolf- Rayet supernova rebuts 
the idea that the biggest stars go quietly. 


its days in a colossal explosion. The first light to hold the 

record of this supernova reached Earth about this time last 
year. Just a few hours later, quick-thinking astronomers were able to 
point a telescope at the hole in the sky where the star had been. The 
resulting images help to resolve a key question in stellar physics. And 
they might raise more questions about the fate of Earth. 

Supernovae are one of the most stunning events in the night sky; 
the explosions are so well known for their violence that the term has 
even entered common parlance. Yet supernovae are rare, and so, 
therefore, are direct observations of the circumstances immediately 
before and after them. 

As astronomers describe on page 471 of this issue, being able to 
focus on the immediate aftermath of a supernova has shed new light 
on why some stars go bang with such force. In this case, the emissions 
spectra sent out by the dying star show that it was a Wolf-Rayet star, 
massive bodies that shed their mass rapidly in strong stellar winds. 

The finding is significant because, although astronomers assumed 
that Wolf-Rayet stars would go supernova, there was no direct evi- 
dence that they did. In fact, in the absence of observations of such 


A long time ago, a faraway star threw up its insides and ended 


400 | NATURE | VOL 509 | 22 MAY 2014 


supernovae, a rival theory was gaining ground: that they might 
end their lives not in a bang but with a whimper. As John Eldridge 
explains in an accompanying News & Views article on page 431: 
“Until this event, there was growing evidence that such stars were 
likely to have dim or unobservable deaths.” 

Wolf-Rayet stars are more than 20 times more massive than our Sun 
and are very breezy places: their fierce stellar winds can reach more 
than 1,000 kilometres per second. They are also rare, so if the name 
rings a bell then it could be because you have heard of a particular 
specimen: WR 104, a binary star about 2,450 parsecs (8,000 light years) 
from Earth that shot to fame in 2008 when astronomers warned that 
we could be in the firing line if it exploded. If you are concerned by 
this (and you probably needn't be), then the finding that Wolf-Rayet 
stars do go supernova will do little to ease your anxiety. 

A mere supernova would not threaten us at that distance, but some 
very massive stars explode as two powerful beams of lethal radiation 
known as y-ray bursts. Depending on which way WR 104 is point- 
ing — and the jury remains out on that — one of those bursts could 
head our way. 

There are plenty of ifs and buts there — evidence suggests, for 
instance, that WR 104 has the wrong environment for y-ray bursts 
— but, technically, the odds of such an event just shortened, very 
slightly. All Wolf-Rayet stars will go bang, 
the paper proposes, WR 104 included. The 
question is when — it could be next week, or 
thousands of years hence. Or it may already 
have happened. m= 
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approach to problem-solving. The Longitude Act of 1714 offered 

a prize of £20,000 (worth more than £1 million today) to anyone 
who could devise an accurate method to determine a ship's position 
at sea. Among those on the committee that judged the merit of the 
entries was the serving Astronomer Royal. 

History is repeating itself. In 2014, there is a pressing need for the 
United Kingdom to channel more brainpower into innovation, to 
jump-start new technologies and to enthuse young people. There 
are broad societal problems that demand fresh thinking. So Britain 
is reconvening the historic Longitude Committee, this time with a 
promised reward of £10 million (US$17 million). Some things are 
different: in 2014, the challenge to be addressed by the Longitude 
Prize will be decided not by government offi- 
cials, but by the public. And some things are the 
same: as Astronomer Royal, I chair the resur- 
rected committee. 

Starting on 22 May, after detailed examina- 
tion by the BBC science programme Horizon, 
the public will be able to vote on the theme of 
the prize from a shortlist of six broad topics, each 
important to human welfare and each offering 
scope for creativity: dementia; access to safe, 
clean water; treating paralysis; antibiotic resist- 
ance; food and nutrition; and flight. The out- 
come of the vote will be announced on 25 June. 
An expert group will then define and set the rules 
for a specific challenge. 

On flight, for example, the challenge for 
scientists and engineers could be to reduce the 
environmental impact of air travel. On demen- 
tia, it could be to find a way to help people with 
dementia to live independently for longer. The £10-million prize fund 
will be open for five years as a reward for the best answer to this single 
specific problem — although the prize could be configured (as was the 
challenge of 1714) so that rewards are offered for intermediate steps. 

The new Longitude Prize was first announced by Prime Minister 
David Cameron last year. Some are likely to be sceptical of its effective- 
ness. I think they are wrong. 

A well-designed prize should unleash investment from many quar- 
ters, amounting to much more than the prize itself, by enhancing the 
competitive focus on a challenge important for human welfare. The 
contest should also be newsworthy enough to raise the profile and repu- 
tation of innovators, and to stimulate young people's interest and enthu- 
siasm — and that could in itself have substantial 


lE is 300 years since the British government pioneered a new 


social value. For an individual or small company, NATURE.COM 
the prize money is a significant incentive; fora Discuss this article 
big company, the publicity will be moreimpor- _ online at: 

tant. Thus both have a motive for participating. _go.nature.com/7cakri 


THERE IS A 


PRESSING 
NEED FOR THE 
UNITED KINGDOM TO 


CHANNEL 
MORE BRAINPOWER 
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INNOVATION. 


G A Longitude Prize for the 
. 7 twenty-first century 


The UK Government’s new prize for substantial innovation to address 
pressing societal problems should be welcomed, says Martin Rees. 


The original longitude challenge was the talk of London's coffee- 
houses. In the eighteenth century, Britain was a maritime power 
striving against Spain and the Netherlands to rule the waves. Many 
ships were wrecked because there was no way to keep precise track 
of longitude. 

Astronomers believed that the answer would come from the stars. 
John Harrison, a working-class joiner and clockmaker with little for- 
mal education, came closest to receiving the reward money through 
his extraordinary mechanical talent and determination, culminating 
in his H4, the marine chronometer still on display at the Royal Obser- 
vatory in London. 

In the subsequent 300 years, the ‘challenge prize concept has been 
widely emulated — for instance, a prize was the stimulus for Charles 
Lindbergh's first transatlantic flight. In the United 
States, the publicly funded Defense Advanced 
Research Projects Agency (DARPA) has spon- 
sored competitions for driverless vehicles. The 
total investment by contestants — private entre- 
preneurs and universities — has been much more 
than the $6.5 million offered in prizes. 

Pre-eminent in such ventures in the United 
States is the X-prize Foundation in Culver City, 
California, which oversees and monitors pri- 
vately sponsored prizes. Its ambition is to revi- 
talize markets that are currently “stuck due to 
existing failures or a commonly held belief that 
a solution is not possible”. The first X-prize, for 
privately developed sub- orbital space flight, was 
awarded in 2004. 

This type of prize has advantages over more 
conventional awards. The winner is decided 
objectively — as in athletics, and unlike the 
Oscars and literary prizes. And such prizes recognize and boost up- 
and-coming talent — unlike the Nobel and similar prizes, for which 
recognition may be delayed for decades. 

The original Board of Longitude lasted more than a century, and 
offered rewards for further discoveries and innovations. It was, in a 
way, the precursor of Britain’s current Research Councils, Technology 
Strategy Boards, and so on. No longer is there a manifest number- 
one problem as there was in the eighteenth century. Today's research 
agenda is hugely more diverse and on a much larger scale; both public 
and private sectors provide many incentives and pathways to innova- 
tion that did not then exist. The £10-million offered by the new Lon- 
gitude Prize is less than a thousandth of what Britain spends each year 
on research and development. But I am confident that it could have 
a disproportionate impact: it is surely an experiment worth trying. = 


Martin Rees is chair of the Longitude 2014 Committee. 
e-mail: longitude. prize@nesta.org.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Star partners form 
strongest magnet 


Having a stellar partner might 
allow a dying star to become 
a magnetar — the strongest 
known magnet in the Universe 
— instead ofa black hole. 
Large stars usually form 
black holes when they 
die, so astronomers have 
wondered why some become 
magnetars, an unusual kind 
of neutron star. Simon Clarke 
of the Open University in 
Milton Keynes, UK, and his 
colleagues used the European 
Southern Observatory’s Very 
Large Telescope to study the 
Westerlund 1 star cluster, 
which includes a magnetar. 
They found a star that 
they say was probably the 
companion to the one that 
formed the magnetar. The two 
orbited each other closely, and 
as the larger one began to die, 
it transferred its outer layers to 
the smaller star. This made the 
small star rotate more rapidly, 
eventually creating an ultra- 
strong magnetic field. 
Astron. Astrophys. 565, A90 (2014) 


Oldest sperm 
found in fossil 


Tiny fossil crustaceans 

found in Australia contain 
remarkably preserved giant 
sperm that are between 

16 million and 23 million years 
old — the oldest reproductive 
cells ever discovered. 

Renate Matzke-Karasz at 
Ludwig Maximilian University 
in Munich, Germany, and 
her colleagues examined 
ancient freshwater sediments 
in Queensland and used 
synchrotron X-rays to image 
the internal structure of fossil 
ostracods, crustaceans around 
1 millimetre in length. The 
authors found well-preserved 


BIOPHYSICS 


Fast swimming with fake shark skin 


A material that mimics shark skin enables a 
swimming robot to move quickly through the 
water by improving hydrodynamics. 

Li Wen, James Weaver and George Lauder 
at Harvard University in Cambridge, 
Massachusetts, used three-dimensional 
printing to embed thousands of rigid tooth- 
like scales on a flexible membrane (pictured), 
based on the skin structure of the shortfin 


detail of soft tissue, including 
internal organs and sperm 
clusters around 1.2 mm long. 
The team even identified 
nuclei in some sperm. 

Many modern ostracods 
have huge sperm relative to 
their body size, and these 
fossils show that the trait 
evolved long ago, although it is 
not clear why. 

Proc. R. Soc. B http://doi.org/ssh 
(2014) 


Simple recipe for 
small molecules 


A synthesis method that uses 
just one chemical reaction 
and 12 building blocks 

could allow chemists to 
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6.6% faster. 


automate construction of the 
backbones of thousands of 
small molecules. 

This kind of modular 
simplicity is standard in 
the laboratory synthesis of 
proteins, nucleotides and, 
increasingly, carbohydrates. 
Martin Burke and his 
colleagues at the University of 
Illinois at Urbana-Champaign 
analysed more than 2,800 
natural products, including 
pharmaceuticals, that contain 
polyene motifs — chains of 
carbon atoms connected by 
alternating single and double 
bonds. The researchers report 
that more than 75% of polyene 
structures can be made by 
sequentially linking building 
blocks from a small library 
of organic acid molecules 
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mako shark (Isurus oxyrinchus). The authors 
compared the synthetic shark skin to a 
smooth control model in a robot swim test 
and found that the experimental skin moved 


The skin eases swimming both by reducing 
drag and by generating vortices that boost 
thrust, the authors suggest. 

J. Exp. Biol. 217, 1656-1666 (2014) 


that contain boron. These 
‘MIDA-boronates’ were 
invented by Burke’s group and 
are commercially available. 
This approach avoids the 
need to invent a customized 
method for every polyene- 
containing compound, the 
authors say. 
Nature Chem. http://doi.org/ssv 
(2014) 


| GENOMICS 
Spider genomes 
hold venom secrets 


Genome sequences from 
two spider species reveal the 
composition of their silk, and 
how spider venom exacts its 
toxic toll. 

Mikkel Schierup at Aarhus 
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University in Denmark and his 
team sequenced the genomes of 
an African social velvet spider 
(Stegodyphus mimosarum) 

and a Brazilian white-knee 
tarantula (Acanthoscurria 
geniculata; pictured), and 
identified proteins that 
comprise the arachnids’ venom 
and silk. The venom of both 
species contains numerous 
protein-cleaving enzymes that 
probably activate precursors 

of toxic venom proteins. More 
than 70% of velvet-spider 
venom is made up of enzymes 
that break down fats. 

The team also found that 
the velvet spider makes a more 
diverse array of silk proteins 
than the tarantula. That is 
probably because it uses its silk 
in a greater variety of ways, 
such as creating complex webs 
to capture prey, says the team. 
Nature Commun. 5, 3765 (2014) 


REGENERATIVE BIOLOGY 


Stem cells make 
bone in monkeys 


Allowing a type of stem cell to 
mature before transplanting it 
into monkeys seems to prevent 
the cells from forming tumours 
or triggering inflammation — 
two safety concerns that have 
plagued the field. Instead, the 
cells generated new tissue. 
Induced pluripotent 
stem (iPS) cells can forma 
variety of tissues and can be 
made using a patient’s own 
cells. Cynthia Dunbar at the 
National Institutes of Health in 
Bethesda, Maryland, and her 
team made iPS cells from the 
skin and bone marrow cells of 
two monkeys, and transplanted 
the cells back into the donor 
monkeys. These immature 
cells generated tumours and 
stimulated an inflammatory 
response. However, iPS cells 
that had developed into bone 
precursor cells before being 
transplanted did not cause 


tumours or inflammation 
during the nearly year-long 
study, and formed new bone. 
The findings show the 
promise of iPS cells for 
repairing tissue in humans, the 
researchers say. 
Cell Rep. http://doi.org/stw (2014) 


Antarctic areais 
doomed to melt 


Several of Antarctica’ glaciers 
have already begun an 
unstoppable meltdown, two 
studies suggest. 

Eric Rignot at the University 
of California, Irvine, and his 
colleagues used satellite radar 
to measure the retreat of five 
glaciers in West Antarctica 
and found that there is nothing 
holding the ice sheets back 
from catastrophic collapse, 
leaving them more vulnerable 
than previously thought. These 
glaciers hold enough water 
to raise the global sea level by 
1.2 metres. 

A team led by Ian 
Joughin at the University 
of Washington in Seattle 
modelled the behaviour of one 
of these glaciers, the Thwaites 
Glacier, and found that it is 
permanently destabilized. 

The melting of Thwaites will 
probably raise sea levels by 

2.5 centimetres over the next 
century and by more thana 
millimetre per year within two 
to nine centuries, the team says. 
Geophys. Res. Lett. http://doi.org/ 
srf (2014); Science 344, 735-738 
(2014) 


MATERIALS 


Plastics recycled 
with acid 


The plastics often found in 
electronics typically cannot 
be recycled, but researchers 
have come up with a way of 
producing polymers that can 
be broken down into reusable 
monomers using acid. 
Jeannette Garcia and 
James Hedrick at the IBM 
Almaden Research Center 
in San Jose, California, and 
their colleagues created a 
strong and stable material 


RESEARCH HIGHLIGHTS 


THIS WEEK 


SOCIAL SELECTION 


Family history wins gene debate 


Discussion on social media about a recent genetics talk 
propelled a five-year-old paper into the limelight. Yale 
University bioinformatician Mark Gerstein, while at the 
Biology of Genomes meeting in Cold Spring Harbor in 

early May, live-tweeted a provocative quote from Stanford 
University geneticist Mike Snyder's talk: “Your genome is [a] 
better record of your family history than your family.’ A lively 
Twitter debate ensued. Geneticist Leonid Kruglyak of the 
University of California, Los Angeles, following the conference 
hashtag from afar, tweeted a dissenting view: “Family history 
wins: Predicting human height by Victorian and genomic 
methods. He linked to a 2009 paper showing that a simple 
average of the parents’ heights could predict a person's height 
better than a genetic profile of 54 locations along the genome. 
Aulchenko, Y. S. et al. Eur. J. Hum. Genet. 17, 1070-1075 (2009) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


called a thermoset polymer, 
used in automotive, aerospace 
and electronic parts. They 
linked together amine- 

based monomers with 
paraformaldehyde to form the 
thermoset. By exposing the 
material to acid solutions of 
less than pH 2, the researchers 
ended up with the monomer 
building blocks that they could 
then reuse. 

Science 344, 732-735 (2014) 


ANTHROPOLOGY 


Siberian origin for 
Native Americans 


The discovery of a roughly 
12,000-year-old human 
skeleton lends weight to the 
theory that Native Americans 
descended from Siberian 
migrants, rather than from 
people in a later migration. 
Ancient American 
skulls look different from 
those of modern Native 
Americans, leading some to 
think that Native American 
ancestors arrived after 
humans first crossed the 
land bridge between Siberia 
and North America some 
26,000 to 18,000 years ago. 


NATURE.COM 
For more on 
popular papers: 
go.nature.com/tkaG9i 


James Chatters of Applied 
Paleoscience in Bothell, 
Washington, and his 
co-workers studied a near- 
intact skeleton ofa teenage girl 
found in an underwater cave 
in Mexico (pictured). Analysis 
of the bones revealed features 
similar to those of other ancient 
Americans. However, DNA 
extracted from a tooth carried 
a genetic signature that occurs 
only in Native Americans. 

The results suggest that 
evolution on American soil 
led to the differences between 
ancient and Native Americans. 
Science 344, 750-754 (2014) 
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SEVEN DAYS sescnns 


| RESEARCH 
Statin claims 


Statements reporting that 
cholesterol-lowering statin 
drugs have adverse side effects 
are being withdrawn from 

two articles published in the 
British Medical Journal. The 
move was announced by 
Fiona Godlee, the journal's 
editor-in-chief, in its 17 May 
issue. The conclusion that 
around one-fifth (18-20%) of 
patients on statins experienced 
side effects was based on a 
“misreading” of data froma 
separate observational study, 
Godlee writes. The journal is 
considering whether to retract 
the articles fully. See go.nature. 
com/abswnp for more. 


Brain ethics 


Neuroscientists must give 
greater consideration to 
ethical issues when designing 
experiments, says a 12 May 
report from a panel tasked with 
advising US President Barack 
Obama on bioethical issues. 
The report is the first of two 

to come from the Presidential 
Commission for the Study of 
Bioethical Issues in preparation 
for the Brain Research through 
Advancing Innovative 
Neurotechnologies (BRAIN) 
initiative — a White House- 
backed project that aims to 
map every neuron in the 
human brain. The report says 
that scientists should consider 
issues such as uncovering 
signs of disease when scanning 
individual people’ brains and 
the implications of performing 
neurosurgery for mental 
disorders. 


Threats to US bees 


Last winter, 23.2% of honeybee 
colonies in the United States 
perished, the US Department 
of Agriculture announced on 
15 May. The 2013-14 death 
rate is lower than in previous 
years but is still above the 

level that farmers consider 


Colorado River reaches the sea 


On 15 May, the Colorado River flowed all the 
way to the Gulf of California for the first time 
in 20 years. Around 130 million cubic metres 
of water were released from a dam on the US- 
Mexican border on 23 March, as part of a bold 
ecological experiment to quench the parched 
bed of the once-mighty river (see Nature 507, 


sustainable. Threats such as 
disease, insecticide exposure 
and changes in farming 
practices are thought to be 
contributing to declining bee 
populations in many parts of 
the world. See go.nature.com/ 
xtxbt5 for more. 


POLICY 


Satellite regulation 
The US government has 
relaxed restrictions on the 
export of satellites and their 
component parts to other 
countries. The long-awaited 
move, announced on 13 May, 
shifts most commercial, 
scientific and civilian satellites 
from a munitions list tightly 
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regulated by the Department 
of State into the hands of the 
Department of Commerce. 
University space scientists 

will now find it easier to 
collaborate with international 
partners. But nearly all 
commercial crewed spacecraft 
remain under the control of 
the old munitions regulations. 


University ranking 
A European consortium 
launched a new way to rate 
universities on 13 May. Known 
as U-Multirank, the system is 
a searchable database of the 
performance of universities 
from around the world. 
Institutions are assessed on 

a wide range of indicators, 
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286-287; 2014). The water rushed through 
ecological restoration sites, where researchers 
had uprooted invasive vegetation, allowing 
native species such as cottonwoods and willows 
to germinate. The experiment is part of a 2012 
water-sharing agreement between the United 
States and Mexico. 


such as regional research 
collaboration and support for 
students studying abroad. The 
system allows people to look 
at the performance profile of 
an institution; compare one 
university with another; and 
search for universities that 
perform well in particular 
areas. It differs from existing 
rankings, which tend to rate 
universities in a single league 
table, and rely mainly on 
research performance. 


China open access 
The Chinese Academy of 
Sciences and the National 
Natural Science Foundation 
of China both announced 
new open-access policies on 
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16 May. All researchers and 
graduate students who are 
supported by the agencies are 
now required to make their 
publicly funded studies free 
to read. Research papers must 
be made accessible in online 
repositories no later than a 
year after their publication 

in an academic journal. 

See go.nature.com/ojrmfy 
for more. 


Swift slime 


The single-celled winners 

of an unusual competition 

to crown the world’s 

fastest slime mould were 
announced on 19 May. The 
Dicty World Race, held in 
Boston, Massachusetts, pitted 
strains of the slime mould 
Dictyostelium discoideum in 
a microfabricated racetrack 
800 micrometres long. The 
champion was submitted by 
a team from the University of 
Groningen in the Netherlands, 
and was engineered to be 
particularly sensitive to a 
signalling molecule called 
cyclic AMP, which was used 
as a chemical attractant 

in the race. See go.nature. 
com/6ezbrd for more. 


US sites at risk 


Historic US landmarks 
are at risk of damage and 
destruction caused by 


TREND WATCH 


The academic search engine 
Google Scholar can find about 
88% of all English-language 
scholarly documents on the 
World Wide Web, according to an 
estimate by computer scientists 
Lee Giles and Madian Khabsa at 
Pennsylvania State University in 
University Park (M. Khabsa and 
C. L. Giles PLoS ONE 9, e93949; 
2014). The duo studied the 
coverage of Google Scholar and a 
competitor, Microsoft Academic 
Search. At least 24% of documents 
are freely available, they add. See 
go.nature.com/matsio for more. 


ecrmaae 


sea-level rise, flooding and 
wildfires, warns the Union 

of Concerned Scientists in a 
report released on 20 May. 
The advocacy group, based in 
Cambridge, Massachusetts, 
says that protecting these sites 
should be part of a national 
climate-change strategy. 
Among the top 30 at-risk sites 
are Jamestown in Virginia, the 
Statue of Liberty (pictured) 
and Ellis Island in New York, 
and coastal NASA facilities. 


Space ties cut 


Russia is severing its space- 
science partnership with the 
United States, according to a 
statement given by Russian 
deputy prime minister 
Dmitry Rogozin on 13 May. 
The move is in retaliation 
against US-imposed sanctions 
linked to Russia’s actions 

in Ukraine. Rogozin said 
that Russia will end its 
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long-running collaboration 
with the United States on the 
International Space Station in 
2020. NASA, which depends 
on Russian carriers to ferry 
astronauts to and from the 
space station, says that it has 
not been officially notified 
about the move. See go.nature. 
com/6shdlx for more. 


HAARP to close 


The US government's 

High Frequency Active 
Auroral Research Program 
(HAARP) — a leading 
ionospheric research facility 
near Gakona, Alaska — is 
facing closure. The US$250- 
million facility is used mainly 
by the Defense Advanced 
Research Projects Agency 
(DARPA) but on 14 May, the 
agency said that it would be 
ending its experiments there. 
The US Air Force, which 
manages HAARP, says that 
the programme will shut once 
DARPA concludes its work. 
See go.nature.com/wv69qs 
for more. 


| FUNDING 
Australian budget 


The Australian government 
has allocated Aus$20 billion 
(US$18.7 billion) to a establish 
a Medical Research Future 
Fund to tackle diseases 
including diabetes and cancer, 
it announced on 13 May. 

But its budget also included 
cuts to the nation’s science 


Around 114 million English-language scholarly documents (including 
papers, books and technical reports) can be found on the web. 
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SEVEN DAYS | THIS WEEK | 


25-30 MAY 

The World Organization 
for Animal Health holds 
its general assembly in 
Paris. On the agenda are 
standards for world trade 
in animals and their 
products, and animal 
diseases. 
go.nature.com/dgum4u 


25-30 MAY 

The quadrennial 
assembly of the Global 
Environment Facility 
will discuss expanding 
financing to support 
food security and 
sustainable cities. 
go.nature.com/bodsub 


26-28 MAY 
Worldwide heads of 
research councils will 
discuss open access and 
how to support young 
researchers at the annual 
meeting of the Global 
Research Council in 
Beijing. 
go.nature.com/ofcfna 


agency — the Commonwealth 
Scientific and Industrial 
Research Organisation — and 
the abolition of its renewable- 
energy agency. See go.nature. 
com/jhkcvk for more. 


Deal spurned 

The London-based drug 
company AstraZeneca has 
rejected another buyout offer 
from its New York rival Pfizer. 
On 18 May, Pfizer said that 

it had offered AstraZeneca 
around £55 (US$93) per share 
— valuing the British company 
at roughly £69 billion. Pfizer 
said that this was its final offer 
and it would not attempt a 
hostile takeover. But it urged 
AstraZeneca’s shareholders 

to encourage their company’s 
board to sell. 
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The New Horizons probe (artist’s impression) is set to pass Pluto and its moon Charon in July next year. 
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probe faces crisis 


NASA scientists scramble to find an object in the outer Solar 
System’s Kuiper belt in time for a close-up visit. 


BY ALEXANDRA WITZE 


early 4.3 billion kilometres from Earth, 

| \ | and most of the way to Pluto, NASA’ 

New Horizons spacecraft is in danger 

of missing out on half of its mission. Project 

managers face a looming deadline to identify 

an icy object in the outer Solar System for the 
probe to fly by after it passes Pluto. 

A visit to a Kuiper belt object, or KBO, 

was always meant to be a key part of New 


Horizons’ US$700-million journey, which 
began in 2006. But there is only a slim chance 
that astronomers will find a suitable KBO 
with their current strategy of using ground- 
based telescopes — and securing time on the 
orbiting Hubble Space Telescope is far from 
guaranteed. 

New Horizons will fly past Pluto in July 
2015. Soon afterwards, it must fire its engines 
and set itself on course to fly past a selected 
KBO. Project scientists must identify a 
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KBO in the next several months if they are 
to determine the necessary trajectory well 
enough for New Horizons to aim accurately 
and meet its target. 

“They're running out of time,” says Mark 
Sykes, director of the Planetary Science Insti- 
tute in Tucson, Arizona, who is not involved 
in the mission. “We're not just talking about 
science being lost — we're talking about getting 
return on our investment.” 

New Horizons scientists have asked for 
160 orbits’ worth of observing time on the 
hugely oversubscribed Hubble. It is a rare 
request for a NASA mission already in opera- 
tion. The committee that allocates Hubble time 
will make a decision by 13 June. 

The two targets of New Horizons — Pluto 
and a KBO — each promise different rewards. 
“Two different scientific communities are get- 
ting excited about it,” says Will Grundy, mis- 
sion co-investigator at the Lowell Observatory 
in Flagstaff, Arizona. The Pluto fly-by will be 
the first close-up glimpse of this geologically 
active world and its moons. The KBO visit, 
2-3 years later (see ‘Far horizons’), would be 
the best look yet at one of these primordial icy 
bodies — time capsules from the early days of 
the Solar System. 

In theory, project scientists should have 
identified a suitable KBO long ago. But they 
postponed their main search until 2011, wait- 
ing for all the possible KBO targets to begin 
converging on a narrow cone of space that 
New Horizons should be able to reach after 
its Pluto encounter. Starting to look for them 
before 2011 would have been impossible, says 
Grundy, because they would have been spread 
over too much of the sky. 

Now that the hunt for KBOs is on, the New 
Horizons researchers have mainly been using 
the 8.2-metre Subaru Telescope in Hawaii and 
the 6.5-metre Magellan Telescopes in Chile. 
They have found about 50 new KBOs; none is 
close enough for New Horizons to reach. 

Part of the problem is that the search area is 
in the Galactic Plane, where most of the Milky 
Way’s stars are. The bright stars tend to out- 
shine the light coming from the faint KBOs, 
and the crowded field of view means that the 
discovery rate depends strongly on observ- 
ing conditions. Bad weather and poor-quality 
observation on many nights has ruined much 
of the search. 

Early on, the team also overestimated the 
number of KBOs that it would find. In 
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FAR HORIZONS 


NASA's New Horizons probe has struggled to identify a Kuiper 


belt object to study after it passes Pluto in July 2015. 


LUranus 


Jupiter 


the past couple of years, astronomers have 
discovered that there are fewer dim KBOs than 
was extrapolated on the basis of the number 
of bright ones. “There just aren’t as many faint 
ones,’ says Grundy — and faint, faraway KBOs 
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are what New Horizons must detect. 

For a mission that has been carefully 
planned for many years, the failure to find 
a KBO target is striking. But Sykes is not 
alarmed. “That should have been in the 


calculus, but it really wouldn't have changed 
their ground-based campaign,” he says. 

The team has another eight nights with 
Subaru in late June and July, and hopes to win 
some more time in August. It has also applied 
for extra observing time with Magellan in 
October. Even if conditions are near perfect, 
the chances of snaring a KBO from the ground 
this year are less than 40%. With the requested 
Hubble time, those chances soar to greater 
than 90%. 

Without Hubble, it might still be possible to 
identify a KBO target from the ground early 
next year. But that would complicate the mis- 
sion, because New Horizons would have to 
wait for longer after visiting Pluto to fire its 
engines. In that case, the spacecraft would need 
to execute a sharper turn than team scientists 
would like to reach the correct trajectory. 

In the worst-case scenario, New Horizons 
would be forced to observe a KBO from far 
away. Several possible long-distance targets 
have already been spotted (S. D. Benecchi et al. 
Icarus http://doi.org/ssn; 2014). Even with its 
puny 21-centimetre telescope, New Horizons 
would get a better glimpse of a distant KBO 
than the 2.4-metre Hubble can manage from 
an Earth orbit. 

Not that the team has yet given up hope of 
a close KBO target. “If we can find one,’ says 
Grundy, “we will happily take it.” m 
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Submersible loss hits research 


But scientists remain positive about the future of deep-sea exploration despite 
disintegration of unique US Nereus craft. 


BY DANIEL CRESSEY 


esearchers have lost access to a huge and 
R toi area of the ocean after one 
of the world’s most advanced submersi- 
bles blew apart 10 kilometres underwater dur- 
ing a deep-sea expedition. The loss of Nereus, 
operated by the US Woods Hole Oceano- 
graphic Institution (WHOD), leaves ongoing 
and future projects high and dry. But scientists 
who study the bottom of the ocean have told 
Nature that their research programmes will 
continue, and that the design of a potential 
successor to Nereus is already under way. 
Nereus was in the Kermadec Trench in the 
Pacific Ocean on 9 May when its five camera 
feeds suddenly went blank. Researchers aboard 
the RV Thomas G. Thompson were unable to 
contact the US$8-million craft. They learned its 
fate when pieces of its hull floated to the surface. 
“It was a horrible feeling — of denial and 
disbelief? says Tim Shank, a biologist at the 
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WHOI in Massachusetts and chief scientist 
on the expedition. 

It is likely that one of the ceramic vessels 
used to encase components on Nereus col- 
lapsed under the water pressure, leading to an 
implosive chain reaction. 

Shank’s expedition was part of a US National 
Science Foundation (NSF) programme 
to investigate the hadal zone. This area of 
the ocean, in deep-sea trenches between 
6,000 and 11,000 metres down, is one of the 
least explored regions on Earth. 

Researchers have long dismissed the idea 
that the area is a ‘dead zone’ For example, on its 
first deep dive in May 2009, Nereus discovered 
a new species of anemone in the deepest part 
of the ocean, Challenger Deep in the Pacific's 
Mariana Trench. But systematic exploration 
has been lacking, leading in 2011 to the crea- 
tion of the NSF project, an international col- 
laboration called the Hadal Ecosystem Studies 
(HADES) programme. It aims to determine 
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the composition and distribution of hadal 
species, and the role of hadal pressures, food 
supply and trench topography on community 
structure. 

During the final dives of Shank’s expedi- 
tion, the first for HADES, researchers were 
measuring the metabolic rates of invertebrates 
such as sea cucumbers. This was done by using 
Nereus to place the creatures in a respirometer 
mounted on a separate platform. The work, 
which began in April, would have provided 
crucial data on whether trenches act as carbon 
sinks, and would have enabled researchers to 
start unpicking the food webs of the deep ocean. 

“The loss of Nereus is an absolute tragedy for 
hadal research. We no longer have the capabil- 
ity to explore the lower two-thirds of our ocean 
with a remotely operated vehicle,” Shank told 
Nature. 

Access to the deep is an ongoing challenge 
because only a few specialized vessels can 
reach this part of the ocean. The WHOT’s Alvin 
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The Nereus submersible is thought to have been lost after extreme water pressure led to an implosion. 


vehicle, which is crewed rather than operated 
remotely, can dive to only about 4,000 metres. 
China, Japan, France and Russia have crewed 
submersibles that can reach depths of 
6,000-7,000 metres (see ‘Into the abyss’). Sev- 
eral remotely operated or autonomous research 
robots are also in use, but few can withstand the 
enormous pressures of the hadal zone. 

The only other operational vessels to rival 
Nereus for depth range are the DEEPSEA 
CHALLENGER craft used by film direc- 
tor James Cameron for his 2012 voyage, and 
Japan’s ABISMO vehicle, which has reached 
10,250 metres. But DEEPSEA CHALLENGER 
is nota scientific craft, and ABISMO’s equip- 
ment is not as advanced as Nereus’s was. 

Deep-sea data collection is still possible 
without Nereus. For example, Shank and 
his team have been able to continue some of 
their work by deploying other equipment, 
such as water and sediment samplers. Other 
affected expeditions are also making alterna- 
tive arrangements. Nereus was due to take part 
in another HADES mission later this year, to 
explore the Mariana Trench, and expedition 
chief scientist Jeffrey Drazen of the Univer- 
sity of Hawaii at Manoa says that the mission 
will still go ahead. Its aim is to collect animals, 
images and sediments from the hadal zone, 
and some of the work can be conducted using 
baited traps and instrument platforms, called 
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@ Palaeoamerican skeleton is 
studied in underwater cave. 
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landers, that sink to the bottom, says Drazen. 
His team is also investigating deploying a 
towed camera system. 

However, these are static technologies. 
Nereus gave real-time access and allowed 
scientists to point cameras directly at objects of 
interest and to collect samples on the fly, rather 
than throwing equipment off the side of their 
ship and hoping for the best. 

There is a glimmer of hope for deep-sea 
exploration with submersibles. Even before 
the demise of Nereus, the WHOI was in dis- 
cussions with the Schmidt Ocean Institute 
in Palo Alto, California, about constructing 
another ‘full-ocean capable’ vehicle. The col- 
laboration is in a preliminary design phase. 
Scientists could be using the vehicle by mid- 
2016, says Victor Zykov, the Schmidt's director 
of research. 

The WHOI is in contact with its insurers 
over Nereus. A payout could be used to build 
two of the next-generation vehicles envisaged 
by Schmidt, instead of one. 

As the Nereus disaster shows, having a 
second submersible is a good idea, says WHOI 
engineer Andy Bowen, an expert in remotely 
operated vehicles who was lead scientist on 
the design of Nereus. “This is a very forceful 
reminder of how difficult it is to work in the 
ocean in general, and at the extremes even 
more so,’ he says. = 
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INTO THE ABYSS 


The world’s fleet of deep-sea submersibles 
has dwindled further with the loss of Nereus. 
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The rise of technologies such as satellite tracking could have been foretold by analysing past literature. 


TECHNOLOGY 


Text-mining offers 
clues to success 


US intelligence programme analyses language in patents 
and papers to identify next big technologies. 


BY SARA REARDON 


project backed by a US intelligence 
A might soon make it much 

easier to predict which technolo- 
gies will one day become game-changers. 
Results revealed this week by the Intelligence 
Advanced Research Projects Activity (IARPA) 
suggest that clues in the wordings of, and 
relationships between, scientific papers and 
patents could foretell research successes. 

The project, called Foresight and Under- 
standing from Scientific Exposition (FUSE), 
could enable funders to pick winners, and 
help governments to keep an eye on ‘disrup- 
tive technologies’ — those that they feel might 
threaten national security or outpace regula- 
tions, for instance. Past examples include 
nanotechnologies and information technolo- 
gies, such as the use of the Global Positioning 
System in mobile phones to allow tracking of 
individuals’ movements. Last week, FUSE, a 
four-year project that started in 2011, entered 
its last phase: to predict the successes three to 
five years from now. 

Although abstracts have been text-mined 
in the past for keywords and other clues in 
language, FUSE is one of the first projects to 
mine the whole bodies of papers and patents. 
So far, it has performed more than 2 million 
analyses of past data to pick out key advances, 
says Dewey Murdick, manager of the FUSE 
project. From these, it has identified several 
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hundred indicators, such as new collaborations 
or expressions of excitement in text, that high- 
light emerging areas. 

“What wed like to glean is understanding 
of the right combination of things that leads 
to success,” says John Byrnes, a computer sci- 
entist at SRI International, an innovation cen- 
tre in Menlo Park, California, whose team is 
one of three developing software for FUSE. To 
make predictions, his program mines text for 
keywords, citations and phrases that indicate 
authors’ outlooks in scholarly papers. 

One example he 


cites isthe resolution “Abbreviations 
of atechnical prob- imply 
lem that, once solved, acceptance by 


led to what is now a 
mainstay of solar- 
panel technology. In the mid-1990s, millions 
of dollars were invested in research into solar 
panels that used aqueous solutions to convert 
photons into energy. Although promising at 
first, by 2008, the technology had been over- 
taken by the much more stable and effective 
solid-state solar panels. FUSE might have pre- 
dicted the aqueous panels’ demise, according 
to results presented by Byrnes and his team 
this week at the US Department of Energy’s 
SunShot Grand Challenge Summit in Ana- 
heim, California. 

Scientific literature analysis, or ‘sciento- 
metrics; is decades old. Organizations such 
as Thomson Reuters, an information firm 


the community.” 
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headquartered in New York, have long used 
these analyses to identify the most influential 
papers or researchers in a field. FUSE takes 
this further by mining millions of papers and 
patents in both English and Chinese, two of the 
most commonly used languages in scientific 
literature, says Murdick. 

The analysis and indicators can predict 
whether a nascent field will become prominent 
or whether it is simply a source of excitement 
that will soon die out, says Olga Babko- Malaya, 
a research engineer at BAE Systems in Win- 
chester, Massachusetts, who heads another 
FUSE team. 

Her team uses software algorithms to 
analyse ‘sentiment’ in the natural language 
of papers. For instance, authors might say 
that their work builds on or contradicts a 
cited paper, or use descriptive language that 
expresses excitement. 

The researchers also found that promising 
topics invent their own jargon and start using 
more acronyms. “Abbreviations imply accept- 
ance by the community and are indicators of 
more mature technologies,” says Babko-Malaya. 

The changes in group collaborations in a 
field over time can also be predictive. FUSE 
researcher Lance Ramshaw at Raytheon BBN 
Technologies in Cambridge, Massachusetts, 
and his team are analysing networks between 
different topics, keywords and authors. He says 
that a new topic may be emerging when promi- 
nent authors start contributing to a group of 
papers that share common traits, or when alli- 
ances between collaborations shift. 

Alan Porter, who specializes in technology 
forecasting at the Georgia Institute of Tech- 
nology in Atlanta, agrees that retrospective 
predictions, such as the solar-panel exam- 
ple, are useful for modelling what companies 
have been doing and for tracking the history 
of a product. The more difficult task, he says, 
would be to use such a network to identify the 
“white spaces”: areas between technology clus- 
ters that are ripe for new research. 

Ideally, analysis will show patterns or tip- 
ping points that are common to success stories. 
Such patterns might eventually allow the pro- 
ject to forecast when a product might launch, or 
whether a drug will be approved by regulators, 
explains Babko-Malaya. 

Although software is catching up, human 
analysts remain the best forecasters, says 
Murdick. “You can ask experts anything you 
want,” he says. 

In another project, Forecasting Science 
& Technology, [ARPA is funding an online 
crowdsourcing project called SciCast, run 
by George Mason University in Fairfax, Vir- 
ginia, and the American Association for the 
Advancement of Science. It aims to consult 
10,000 scientists to help develop methods for 
generating accurate forecasts. “My personal 
bias is that it’s a combination of people and 
machines that will ultimately provide the most 
useful value,” says Murdick. = 
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Jelly genome mystery 


Publication of the draft genetic sequence of a comb jelly 
reveals anervous system like no other. 


BY EWEN CALLAWAY 


omb jellies, or ctenophores, look like 
( tiny disco balls and propel themselves 

around oceans using specialized hairs, 
lapping up small prey with their sticky tenta- 
cles. “They are aliens who've come to Earth,’ 
says Leonid Moroz, a neuroscientist at the 
University of Florida in St Augustine. 

The genome of the Pacific sea gooseberry 
(Pleurobrachia bachei), which Moroz and his 
team report online today in Nature, adds to 
the mystery of ctenophores (L. L. Moroz et al. 
Nature http://dx.doi.org/10.1038/nature13400; 
2014). The sequence omits whole classes 
of genes found in all other animals, includ- 
ing genes normally involved in immunity, 
development and neural function. For that rea- 
son, the researchers contend that ctenophores 
evolved a nervous system independently. 

Ctenophores have long vexed taxonomists. 
Their resemblance to jellyfish earned them 
a spot on the tree of life as a sister group to 
cnidarians (the phylum that includes jelly- 
fish). On the basis of their nervous systems — 
which can detect light, sense prey and move 
musculature — many researchers had them 
branching off from the common ancestor of 
other animals after the sponges and flattened 
multicellular blobs known as placozoans, 
neither of which have a nervous system. Now 
armed with data showing that ctenophores 
lack many common genes, some scientists 
contend that these are the closest living rela- 
tives to the first animals. 

Moroz’s team argues that the P. bachei 
genome, along with gene-expression data 
from other ctenophores, supports this 
theory. For example, microRNAs, which reg- 
ulate gene expression in other animals, are 
completely missing from the sea gooseberry 
genome. 

The biggest surprise, Moroz says, was the 
absence of many standard components of a 
nervous system. Nearly all known nervous 
systems use the same ten primary neurotrans- 
mitters; the Pacific sea gooseberry seems to 
employ just one or two. Moroz speculates that 
the organism might complete its nervous sys- 
tem using molecules that researchers have not 
yet found in this species, such as specialized 
protein hormones. 

The uniqueness of this ctenophore’s nervous 
system leads Moroz and his team to argue that 
it must have evolved independently, after the 
ctenophore lineage branched off from other 


Pleurobrachia bachei lacks many common genes. 


animals some 500 million years ago. “Everyone 
thinks this kind of complexity cannot be done 
twice,” Moroz says. “But this organism suggests 
that it happens,’ 

Gert Worheide, an evolutionary geobiol- 
ogist at Ludwig Maximilian University in 
Munich, Germany, is intrigued by the theory 
that the nervous system evolved twice in 
different animal branches, but disputes that 
ctenophores are the closest relatives of the first 
animals. 

The common ancestor of all animals may 
have looked nothing like comb jellies, and 
the P. bachei nervous system may be a more 
recent adaptation, he says. “I think the last 
word is not spoken yet on where the cteno- 
phores go.” m 
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| NEWS IN FOCUS 


CUTTING COSTS 


‘Tiered pricing’ has reduced the cost of drugs to treat HIV in low- and middle-income countries (LMICs), 
but bigger savings can be generated by encouraging the production of cheap generic medicines. 
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Activists sound alarm 
on tiered drug prices 


Plan could increase health costs in middle-income nations. 


BY ERIKA CHECK HAYDEN 


lobal-health campaigners and 
Cjsstes are protesting against a 

proposal that they say would allow 
drug companies to raise prices for needed 
medicines in many developing countries. 

On 13 May, a coalition of 220 non-govern- 
mental organizations (NGOs) condemned a 
proposal from the Global Fund to Fight AIDS, 
Tuberculosis and Malaria that promotes ‘tiered 
pricing, in which drug companies offer the 
same medicines at vastly different prices in dif- 
ferent countries. This approach was first used 
in the early 2000s to cut the cost of medications 
for HIV/AIDS, malaria and tuberculosis in 
some of the world’s poorest countries. Econo- 
mists say that it ultimately leads to higher costs 
than does allowing the manufacture of cheaper 
generic medicines (see ‘Cutting costs’). 

The current debate centres on middle- 
income countries such as India, China, South 
Africa and Brazil, which have expanding 
economies but are home to the vast majority 
of the world’s ill people — many of whom are 
poor. As these nations work to increase access 
to health care, they face rising prices for drugs 
and vaccines. But their growing overall wealth 
puts them at risk of losing access to prices and 
assistance reserved for low-income countries. 

“These countries are being hit with a dou- 
ble whammy,’ says Rohit Malpani, director of 
policy and analysis for the Access Campaign 
of Médecins Sans Frontiéres (also known 
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as Doctors Without Borders) in Geneva, 
Switzerland. 

The Global Fund, which is also based in 
Geneva and is the world’s largest supporter 
of international health programmes, is trying 
to organize an international group to address 
the problem for middle-income nations; early 
plans for its Equitable Access Initiative (EAI) 
mention a global system of tiered pricing as 
a potential solution. According to the fund’s 
documents, organizations that have expressed 
interest in the EAI 


include the GAVI “These countries 
Alliance, a vaccine are being hit 
initiative based in withadouble 
Geneva; the World whammy. 2 


Bank; the United 

Nations Development Programme; UNICEF, 
the UN children’s fund; and UNITAID, a group 
run by the World Health Organization that 
negotiates medicine prices for poor countries. 

Seth Faison, the Global Fund’s head of 
communications, says that the organization 
aims to make drugs more affordable in needy 
countries. “We all have the same ultimate goal, 
which is maximum access to drugs at the low- 
est possible prices,” he says. 

But critics say that the fund’s plan would let 
companies set prices that are affordable only 
for the wealthiest people in these areas. They 
point to examples such as the tiered pricing 
for sofosbuvir, a breakthrough hepatitis C drug 
marketed by Gilead Sciences. Gilead, based in 
Foster City, California, charges US$900 per 
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treatment course in some low- and middle- 
income nations, such as Egypt, Mozam- 
bique, Kenya and Myanmar (see Nature 508, 
295-296; 2014). This is a steep discount from 
the $84,000 that the company charges in the 
United States — where private insurers can 
absorb some of the cost. But the tiered price is 
more than the gross domestic product (GDP) 
per capita in Mozambique, about the same as 
the GDP per capita in Kenya and about one- 
third of that in Egypt. 

“In middle-income countries with high 
income inequalities, companies make more 
money by selling to the rich people than they 
do by selling to the 90% of people who are 
poor,’ says Brook Baker, a senior policy analyst 
for the Health Global Access Project, a non- 
profit organization based in New York City. 

Many would rather see the Global Fund con- 
tribute to or support existing access initiatives. 
This week’s annual meeting of the World Health 
Organization in Geneva, for instance, includes 
discussions of four projects that aim to test new 
ways to develop affordable medicines. 

The message may be getting through. The 
latest drafts of the Global Fund’s proposal have 
backtracked: a version released to the fund’s 
board on 15 May says that mentions of tiered 
pricing were made “in error”. Yet its list of solu- 
tions to rising drug costs puts “economic clas- 
sification of countries” first — leaving open the 
possibility that prices might be set according to 
a country’s wealth. Middle-income countries 
could still be made to pay prices that are unaf- 
fordable for many of their citizens. 

This alarms global-health researchers such 
as Suerie Moon at the Harvard School of Public 
Health in Boston, Massachusetts. “We have ten 
years of experience that shows that tiered pric- 
ing is going to lead to higher pricing,” she says. 

One study found that the US President's 
Emergency Plan for AIDS Relief, the world’s 
largest single-country medical-aid programme, 
saved $323 million between 2005 and 2008 by 
buying generic HIV medicines instead of tiered- 
priced drugs (C. B. Holmes et al. J. Am. Med. 
Assoc. 304, 313-320; 2010). 

Moon is also worried about the EAT’s poten- 
tially far-reaching impacts. Currently, com- 
panies negotiate tiered prices with individual 
countries and other buyers, but a systematic 
global pricing scheme might force more coun- 
tries to pay higher prices. 

“When you have multiple agencies coming 
together to put in place a system of rules and 
policies that have not worked well in the past 
ten years, I think that’s a real cause for con- 
cern,’ says Moon. = 


CORRECTION 

The Editorial ‘An accident waiting to 
happen’ (Nature 509, 259; 2014) stated 
that 13 workers were exposed to levels of 
contamination. In fact, 21 were exposed. 


SOURCE: PLOS BLOGS, GO.NATURE.COM/NXAXIO; UNAIDS; MSF 


THE POWER OF 


THREE 


Techniques that transfer DNA from diseased human eggs 
to healthy ones — creating offspring with three biological 
parents — are on the verge of clinical use. 


BY EWEN CALLAWAY 


ouglass Turnbull spends much of his time seeing 
patients who have untreatable, often fatal, diseases. But 
the neurologist has rarely felt more helpless than when 
he met Sharon Bernardi and her young son Edward. 
Bernardi had lost three children within hours of 
birth, owing to a mysterious build-up of acid in their 
blood. So it was a huge relief when Edward seemed to 
develop normally. “He did all his milestones: he sat up, he crawled and 
started to walk at 14 months,’ Bernardi recalls. But when he was about 
two years old, he began to fall over after taking a few steps; he eventually 
started having seizures. In 1994, when Edward was four, he was diagnosed 
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Leigh's disease, a condition that affects the central nervous system. Doc- 
tors told Sharon that her son would be lucky to reach his fifth birthday. 

Turnbull, who works at Newcastle University, UK, remembers 
despairing that “whatever we do, we're never going to be able to help 
families like that”. His frustration sparked a quest to develop assisted- 
reproduction techniques to prevent disorders such as Leigh's disease, 
which are caused when children inherit devastating mutations in their 
mitochondria, the cell’s energy-making structures. 

The procedures — sometimes called three-person in vitro fertiliza- 
tion (IVF) — involve transferring nuclear genetic material from the egg 
of a woman with mutant mitochondria into another woman's healthy 
egg. Turnbull and others have tested the techniques in mice, monkeys 
and human egg cells in culture; now, they say, it is time to try them in 
people. The UK Parliament is set to vote on the issue later this year; if 
legislation passes, the country would be the first to allow this kind of 
genetic modification of unborn children. 

But some scientists have raised concerns over the safety of the pro- 
cedures, and an increasingly vocal coalition of activists, ethicists and 
politicians argues that a ‘yes’ vote will lead down a slippery slope to 
designer babies. US regulators and scientists are closely watching the 
debate as they consider allowing similar procedures. “I admire what 
they've done in Britain,” says Dieter Egli, a stem-cell scientist at the New 
York Stem Cell Foundation, a non-profit research institute. “I think they 
are far ahead in discussion of this, compared to the US” 


A FATAL INHERITANCE 

The mitochondrion, according to one popular theory, was once a free- 
living bacterium that became trapped in a host cell, where it boosted the 
cell’s capacity to generate the energy-carrying molecule ATP. Asa result, 
each mitochondrion has its own genome — but it no longer has all the 
genes it needs to function independently (the human mitochondrial 
genome, for example, has a paltry 37 genes). 

Unlike the genome in the cell nucleus, which includes chromosomes 
from both parents, all of a person’s mitochondria derive from the thou- 
sands contained in the mother’s egg. For reasons still being studied, the 
mitochondrial genome is much less stable than the nuclear genome, 
accruing random DNA mutations about 1,000 times faster. As many as 
1 in 5,000 children are born with diseases caused by these mutations, 
which affect power-hungry cells such as those in the brain and muscles. 
The severity of the conditions depends on the proportion of diseased 
mitochondria a mother passes on to her children. 

Turnbull first got interested in mitochondrial disease and energy 
metabolism in the late 1970s, when he was working as a junior doctor 
on a neurology ward. A member of the Royal Air Force arrived at his 
clinic with a mysterious ailment: whenever he went on training runs, his 
muscles would suddenly give out and force him to stop. Turnbull at first 
suspected that the airman had a mitochondrial disease — and although 
he turned out to be wrong, his curiosity was piqued. 

Turnbull found that the treatment options for mitochondrial dis- 
eases were limited to managing symptoms, for example by prescribing 
anticonvulsant drugs to ward off seizures, rather than addressing the 
underlying biological problem. “You see them develop a mitochondrial 
disease and there's bugger all you can do about it,” he says. The young 
neurologist went on to do a PhD on the inner workings of mitochondria, 
and has devoted his career to understanding how they malfunction. 

After Turnbull met the Bernardis in the mid-1990s, a muscle biopsy 
confirmed that Sharon carried mutant mitochondria. “He couldn't believe 
Ilooked so well,” she says. The diagnosis helped Sharon to understand 
some of her health problems — and her family’s. Her mother, it turned 
out, had lost several children, and was experiencing heart difficulties in 
her fifties; a cousin and other family members had also lost children. “It’s 
been a family wiped out,” says Bernardi, who lost three more babies after 
Edward was born. Her tragedy spurred Turnbull to seek ways to keep 
children from inheriting their mothers’ mutant mitochondria. 

Others had been thinking along similar lines. In the 1980s, embryolo- 
gists working with mice had begun using ‘pronuclear transfer’ techniques 
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to investigate the developmental role of egg cells’ cytoplasm. The proce- 
dures involve moving nuclear DNA from one fertilized egg to another, 
leaving in place most of the other contents, including the mitochondria. In 
1995, researchers raised the idea that similar procedures could interrupt 
the transmission of mitochondrial diseases in human eggs’. 

Turnbull’s laboratory began replicating the mouse research in the early 
2000s, aiming to move quickly to human eggs. Working with Mary Her- 
bert and Alison Murdoch, reproductive biologists at Newcastle Univer- 
sity and an affiliated fertility clinic that provides IVE, they planned to 
start with eggs that had not been fertilized correctly and had no hope of 
generating a fetus. 

It took 18 months to convince regulators to allow the first experiments. 


"THIS ISN'T “DESIGNER BABIES’. THIS 
IS ABOUT PREVENTING SERIOUS, LIFE- 
THREATENING, DISABLING DISEASES.” 


The UK Human Fertilisation and Embryology Authority (HFEA) twice 
denied the team’s application, on the grounds that the procedures would 
alter the “genetic structure” of the egg — illegal under the 1990 Human 
Fertilisation and Embryology Act, which had established the agency. In 
response, the researchers argued that the term was vague and did not 
apply to pronuclear transfers. They filed a third appeal, this time with 
lawyers to argue their case, and won approval in 2005. 

Around the same time, the UK Parliament began updating the 1990 
law. Revised legislation came into force in 2009 and prohibited clinical 
application of pronuclear transfers — but it allowed for the topic to be 
revisited by Parliament without passing entirely new laws, pending a 
full airing of the scientific, regulatory and ethical issues. The law change 
gave the Newcastle team hope that its experiments, if successful, could 
one day be translated to the clinic. 


A STEADY HAND 

Human egg cells are one-tenth of a millimetre wide, and pronuclear 
transfers must be done under a microscope, in a specially designed 
chamber that controls temperature and air flow. It takes an expert 
embryologist with a steady hand: “People don’t breathe when they’re 
doing this,” says Herbert. 

First, a fertilized egg cell is zapped with a laser, making a hole in its 
membrane. Then the embryologist eases a pipette into the hole and 
plucks out the pronuclei, twin genetic structures that result from ferti- 
lization. Next, the researcher empties a fertilized donor egg of its genetic 
material and squirts the pronuclei into the hollow egg. The feat takes 
several minutes (see video at go.nature.com/ufatcq). If the United King- 
dom approves clinical use of the procedure, the egg would then be incu- 
bated for a few days until it develops into a blastocyst of between 50 and 
200 cells, which would then be transplanted into a womans uterus. 

Ina paper published in May 2010, the Newcastle researchers showed” 
that the abnormally fertilized eggs they had been using could undergo 
pronuclear transfer and then develop in culture almost as well as 
untouched egg cells. Crucially, the transferred pronuclei brought few 
mitochondria with them, suggesting that a resulting embryo would 
largely be free of any disease-causing mutant mitochondria. 

But many questions remained. Could the transfers be done efficiently 
enough that a woman could hope to become pregnant? Did they cause 
subtle molecular or genetic changes that might hinder further develop- 
ment or cause health problems after birth? And would the UK govern- 
ment ever allow them to reach the clinic? 

The Newcastle team has spent the past few years looking for answers, 
optimizing its technique in healthy human eggs. “We're reasonably 
comfortable there's a chance of pregnancy with this,” says Herbert. The 
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AN UNPLANNED EXPERIMENT 


Mitochondrial-transfer pioneers 


t began as a way to help a handful of patients 

to have babies. But fertility specialist Jacques 

Cohen, then at Saint Barnabas Medical Center in 

Livingston, New Jersey, inadvertently launched 

an experiment that could reveal whether 
mitochondrial-replacement therapies are safe to try in 
humans. 

In the mid-1990s, Cohen was struggling to help a small 
number of women who were unable to conceive, although 
they could make enough eggs for in vitro fertilization. The 
women were not old, but their eggs were a mess — the 
cytoplasm around the nucleus was fragmented and littered 
with debris. 

Cohen wondered what would happen if he added a little 
cytoplasm from another woman’s healthy egg. He tried it 
in mice and it worked; so in 1997, he and his team began 
testing the cytoplasm-transfer technique in humans. They 
painstakingly ‘normalized’ the eggs of 33 infertile women 
with less than a picolitre (107? litres) of another woman’s 
egg cytoplasm. Seventeen babies were born as a result of the 
procedure’. 

Cohen knew that the transplanted cytoplasm probably 
contained the cellular battery packs known as mitochondria, 
which he had hoped would enhance embryo development. 
Tests reported® in 2001 confirmed that at least two babies 
had mitochondria (which each carry 37 genes) from both 
their mother and the cytoplasm donor. The team was the 
first in the world to alter a human’s genetic inheritance in 
this way. 

The health implications for the children are unclear: 
studies suggest that mice with such mixed mitochondria 
develop hypertension and obesity in middle age’, and have 
impaired cognition — they escape from mazes more slowly 
than normal mice, for example?®. One of the babies born 
as a result of the procedure was diagnosed with an autism 
spectrum disorder, and two further fetuses had a genetic 
defect known as Turner syndrome (one was miscarried, the 
other aborted). 

The team stopped performing the procedure in 2001, when 
the US Food and Drug Administration said that more research 
was required before it could be used in humans. But nobody 
followed up on the 17 children, who are now teenagers. 

Cohen, now lab director of Reprogenetics, a 
pre-implantation genetic-diagnostics company in Livingston, 
wants to change that. He has teamed up with researchers at 
Saint Barnabas for a two-phase follow-up study, including 
phone surveys with the families and saliva tests of the 
teenagers, if they are willing. The saliva will show whether 
the teenagers’ mitochondrial genes come from both their 
mothers and the cytoplasm donors. 

Serena Chen, a reproductive endocrinologist who 
joined Cohen’s team in 1999 and is the project’s principal 
investigator at Saint Barnabas, says that the timing is right 
for the study. Its results could make a crucial contribution to 
US and UK debates over related techniques aimed at helping 
women with mitochondrial disease to give birth to healthy 
babies. 

“We feel like this is something that would be helpful for 
the other researchers in this area looking at mitochondrial 
disease, to provide some reassuring data that human 
research into this area is not unreasonable to consider,’ she 
says. Karen Weintraub 
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still-unpublished experiments have proceeded slowly, partly because 
healthy human eggs for experimentation are hard to come by. But Her- 
bert says that the group has already performed more than 100 pronuclear 
transfers on such eggs. It also hopes to conduct safety studies to assess 
whether the procedures alter the transferred genome or epigenome. But 
such checks cannot provide complete reassurance before the leap into 
humans, the researchers acknowledge. “We can never say for sure that 
it’s 100% safe,” says Herbert. “It has to, at some point, go to treatment.” 


BATTLE LINES 

In 2010, the Newcastle researchers asked the UK government to consider 
changing the law that prohibits them from conducting their mitochon- 
drial-replacement procedure in humans. The request prompted a flurry 
of hearings, consultations and reports, involving independent scientists, 
bioethicists, regulators, the general public and others; another scientific 
review is expected in the next few weeks. But the protracted process has 
thrown up no major roadblocks. 


“LTHINK WHAT EVERYBODY WAS A 
LITTLE BIT UNCOMFORTABLE WITh 
WAS JUST HOW MUCH IS NOT KNOWN.” 


Nancy Lee, a senior policy adviser at the Wellcome Trust, the United 
Kingdom's largest biomedical-research charity, praises the review as 
“a good example of evidence-based policy-making and informing the 
public as much as is possible”. The London-based charity has funded 
Turnbull’s team to the tune of £4.4 million (US$7.4 million), and has 
thrown its considerable political clout behind changing the law. 

Yet some scientists argue that the procedures have not been vetted 
rigorously enough. Klaus Reinhardt, an evolutionary biologist at the 
University of Tiibingen in Germany, worries about incompatibilities 
between the nuclear and mitochondrial genomes in individuals con- 
ceived using the procedures. Both nuclear and mitochondrial genes are 
needed for mitochondria to function, and itis likely that gene variants in 
both structures have evolved together, he says. Mitochondrial replace- 
ment in mice, fruit flies and other organisms has occasionally resulted 
in problems with respiration, cognition and fertility, several studies 
have found’. Reinhardt, who has expressed his concerns to the panel 
in charge of reviewing the science, questions whether there are enough 
safety data to go forward with clinical trials. “I don’t really know how 
robust everything is,’ he says. In response, Turnbull’s team casts doubt 
on the relevance of mitochondrial-replacement experiments that use 
inbred lab animals, and points out that other studies of mitochondrial 
replacement in mice failed to find health problems’. 

Some critics use more emotive language. In a March Parliamentary 
debate and a column in The Daily Telegraph, Conservative Member of 
Parliament Jacob Rees-Mogg equated mitochondrial replacement with 
cloning, and said that the techniques would promote eugenics. “In a 
country nervous about genetically modified crops, we are making the 
foolhardy move to genetically modified babies,” he said in the debate. 

An international coalition of several dozen scholars and bioethicists, 
many at religious institutions, expressed similar sentiments in March 
2013 ina letter to The Times newspaper, arguing that mitochondrial 
replacement “would open the door to further genetic alterations of 
human beings with unforeseeable consequences”. 

To counter this opposition, Turnbull and other NATURE.COM 
supporters point out that the techniques will be _ Foravideo of 
used only to prevent serious mitochondrial dis- _ pronuclear transfer 
eases. The researchers have made patients’ stories, anda podcast: 
such as the plight of the Bernardi family, central to _go.nafure.com/ufateq 
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their appeals. They have compared mitochondrial replacement to chang- 
ing the batteries in a camera (a poor analogy, some other scientists say), 
and they argue that mitochondrial DNA makes up a tiny fraction of the 
overall genome, with little influence over a person's defining traits. “This is 
notaslippery slope, in my view, Turnbull says. “This isn’t ‘designer babies. 
This is about preventing serious, life-threatening, disabling diseases.” 


MONKEY TRIAL 

A similar debate is shaping up across the Atlantic. While Turnbull and his 
team were developing their pronuclear-transfer technique in human egg 
cells, a US team was testing a related method in monkeys. In 2009, repro- 
ductive biologist Shoukhrat Mitalipov at the Oregon Health and Science 
University in Beaverton and his colleagues reported the birth of two 
healthy rhesus macaques whose mitochondria and nuclei had come from 
different egg cells’. The monkey twins — named Mito and Tracker, after 
a reagent used to make mitochondria glow — were conceived through a 
method called maternal spindle transfer (see ‘Genome transplant’). This 
involves shuttling an egg’s nuclear genetic material to an empty donor 
egg before fertilization, rather than after as in pronuclear transfer. There 
have not yet been any side-by-side experiments to compare the merits of 
the two techniques, although both teams are keen to try. 

Mitalipov’s team has used maternal spindle transfer to conceive five 
monkeys, including one from a previously frozen egg (to mimic a likely 
clinical situation). Mito, Tracker and two others born in 2009 have cel- 
ebrated their fifth birthdays, and are still healthy. Mitalipov plans to breed 
them soon to determine their fertility. His team has also proved its tech- 
nique in human eggs: the embryos formed blastocysts, albeit at a low rate, 
and produced embryonic stem cells with the potential to give rise to all 
the body’s different tissues®. In unpublished work, the researchers have 
since drastically improved the efficiency of the procedure, he says. “Now 
we want to transplant these embryos.” 

First, he will need approval from the US Food and Drug Administra- 
tion (FDA). The agency has required researchers to seek permission for 
mitochondrial transfers since 2001, after a New Jersey fertility clinic 
carried out dozens of procedures that involved moving small amounts 
of cytoplasm — including some mitochondria — between human eggs 
to improve conception rates (see ‘An unplanned experiment’). 

Mitalipov last year put in a proposal to carry out clinical trials in 
humans, and.an FDA advisory panel met in February to discuss the issue. 
The committee spent two days chewing over the same questions that the 
United Kingdom has been grappling with, such as how to establish the 
safety and effectiveness of the procedures in cells and animal models, and 
what the first patient trials might look like. Committee chair Evan Snyder, 
a stem-cell biologist at Sanford-Burnham Medical Research Institute in 
La Jolla, California, says that most of his colleagues are disposed to take 
cellular therapies to patients, and that they recognize the potential to 
unshackle families from the consequences of mitochondrial mutations. 
But, he says, “I think what everybody was a little bit uncomfortable with 
was just how much is not known”. Some panel members wanted to see 
multiple generations of monkeys born healthy using the procedures, as 
well as more safety work on human eggs. 

Mitalipov found the meeting frustrating: “I don’t want to go back 
and do another decade or two decades of research, which we can do. 
But meanwhile, there will be thousands and thousands of children born 
every year that will suffer.’ He says he would consider moving his lab 
to Britain to help bring his research to patients more quickly. Snyder, 
however, senses that his committee is not far from green-lighting clinical 
trials, and that safety hurdles could be surmounted in two or three years. 

Back in the United Kingdom, the legislation to allow mitochondrial 
replacement is still being hammered out — a consultation of the draft law 
finishes on 21 May — but proponents are quietly confident that Parlia- 
ment will say yes. The move has support across the political spectrum, 
and most of the scientific and ethical advice given to the government has 
been encouraging. However, a law change would merely give the HFEA 
the power to allow the procedures, and the agency would probably want 
more safety and effectiveness data before it approved any trials. 
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GENOME TRANSPLANT 


Two different techniques could be used to prevent children 
from inheriting their mothers’ mutant mitochondria. 


Pronuclear transfer Maternal spindle transfer 
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Bernardi hopes that clinical trials will eventually go ahead. But “I 
think it would be bittersweet if somebody had a baby” conceived with 
the procedures, she adds. Her son Edward lived well beyond the expec- 
tations, although he was eventually confined to a wheelchair and his 
health worsened in waves as doctors struggled to find medications 
to quell symptoms including spasms that rendered his arms stiff and 
immobile. Bernardi strove to give him a normal life: he attended school, 
went on class trips and developed crushes. “He liked his girls, he did? 
she says. Bernardi resisted feeding him through a tube until he was 
unable to eat normally, at the age of 20. “Up until the last ten weeks, I 
would say he had a very good quality of life,’ she says. 

Edward Bernardi died in March 2011 after a 21-year struggle with 
Leigh's disease. “I don't think this would benefit me,” says Bernardi of 
the procedures that may be on the cusp of helping other women with 
mitochondrial disease. “But this keeps Edward’s legacy.’ m 


Ewen Callaway is a senior reporter for Nature in London. 
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Arash of road construction is causing 
widespread change in the world’s 
largest tropical forest — with potentially 


global consequences. 


BY BARBARA FRASER 


Carving up the 


ext to a newly paved highway 

in the Peruvian Amazon, a 
N discreet white-on-green sign 

urges travellers to protect the 

surrounding ecosystem. “Let's 

care for the environment, let’s 
conserve the forest,” it reads. But the appeal 
comes too late for this spot in the region known 
as Madre de Dios. Before the route was paved a 
few years ago, tall trees lined the roadside, but 
the forest edge here now lies about half a kilo- 
metre away, beyond a jumble of underbrush 
and freshly cut trees where a cattle pasture was 
recently carved out of the woods. 

As drivers head east and enter Brazil, the view 
is much the same for hundreds of kilometres. 
Such is the impact of the Interoceanic Highway, 
a route some 5,500 kilometres long that cuts 
clear across South America. 

The highway is just one strand in a web of 
roads that now criss-cross the Amazon. So far, 
most have encroached on forest around the 
edges of the basin, but they are increasingly 
slicing through the middle. In Brazil alone, the 
Amazon road system grew by an average of 
almost 17,000 kilometres a year between 2004 
and 2007 (ref. 1). Across the basin, estimates 
for the total length of roads vary widely from 
about 100,000 to 190,000 kilometres of paved 
and dirt roads cutting through the Amazon. 

Once construction begins, road crews are 
quickly followed by land speculators, loggers, 
farmers, ranchers, gold miners and others who 
carve away the forest along the route. That activ- 
ity leaves obvious scars on the landscape in the 
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form of treeless expanses, but research is now 
showing that the building of roads also trig- 
gers a cascade of environmental changes in the 
remaining forest that can dry out trees, set the 
stage for wildfires and weaken the ecosystem. 

“Put a road into a frontier area and it opens 
a Pandora’s box,” says biologist William 
Laurance of the Centre for Tropical Environ- 
mental & Sustainability Science at James Cook 
University in Cairns, Australia. 

The drying brought about by roads influ- 
ences local atmospheric circulation patterns 
and can have farther-reaching effects that not 
only compromise the health of the Amazon but 
can also contribute to global warming by releas- 
ing carbon stored in the forest. Understanding 
those details is crucial, researchers say, for 
determining whether these effects — combined 
with severe droughts such as those that struck 
parts of the Amazon basin in 2005, 2007 and 
2010 — could tip the world’s largest expanse 
of tropical forest from being a net absorber of 
carbon dioxide to a net emitter’. 


THE FIRST CUT 

It was a road that kicked off the pattern of 
destruction in the Amazon forest. In the 1970s, 
Brazil began building the Trans-Amazonian 
Highway from near the country’s easternmost 
point on the Atlantic coast to its western bor- 
der, where the state of Amazonas meets Peru. 
The route opened up the heart of the Amazon 
to logging, ranching and settlement, causing 
deforestation rates to soar. Extreme spells in the 
1990s and early 2000s claimed more than 25,000 
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square kilometres a year — an area bigger than 
New Jersey. Since 2005, government measures, 
including crackdowns on illegal logging, have 
slowed forest loss. Throughout, roads have 
provided the means to penetrate the forest and 
erase large chunks of it. In an unpublished study 
of the Brazilian Amazon, Christopher Barber, 
a researcher at South Dakota State University 
in Brookings, found that 95% of deforestation 
in the region occurs within 7 kilometres of a 
road. And that is not the only problem: just as 
serious as outright deforestation is fragmenta- 
tion, which happens when loggers, ranchers and 
farmers move in. In Brazil, up to 38,000 kilome- 
tres of new forest edge are created each year’. 
Standing in a field in the western Brazilian 
state of Mato Grosso, Michael Coe can feel 
the difference that deforestation makes in the 
Amazon. An atmospheric scientist who heads 
the Amazon programme of the Woods Hole 
Research Center in Falmouth, Massachusetts, 
Coe is visiting an 80,000-hectare patch of for- 
mer forest that was originally cleared some 
years ago to build a cattle ranch, which later 
morphed into a soya-bean farm. The air is 
noticeably hotter and drier in the field than in 
one of the few patches of forest left on the farm. 
Coe and his colleagues are here to study how 
forest degradation and fires alter the flow of 
water and energy in the Amazon ecosystem. 
Evapotranspiration from trees provides mois- 
ture to the air and feeds much of the precipita- 
tion in the Amazon: when the trees disappear, 
so does a major source of moisture. A study 
using satellite data and models of atmospheric 
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“We’re looking at a tidal wave of road 
expansion happening in the next few decades.” 


circulation suggests that air passing over tropi- 
cal regions rich in vegetation produces at least 
twice as much rain as air moving over areas 
with little vegetation’. 

Stripping away trees not only eliminates a 
source of moisture; it also changes the regional 
air flow. Heat rising from a bare field creates a 
low-pressure system that pulls in air from the 
surrounding area, sucking moisture out of the 
nearby forest, says Coe. 

As the forest dries, it transfers less moisture 
to the atmosphere, changing rainfall patterns 
hundreds or thousands of kilometres down- 
wind. That could affect not only forests and 
agriculture across the basin, but also the amount 
of water available to power hydroelectric dams. 
In a simulation using climate, hydrological 
and land-use models, Coe and his colleagues 
projected that reductions in rainfall caused by 
deforestation could drastically cut the power- 
generating capacity of Amazonian dams’. That 
would upset the plans of Brazil, Peru and Ecua- 
dor, which intend to increase hydropower to 
meet rapidly growing electricity demands. 

The drying effect reaches well past the forest's 
edge. And the more fragmented the forest, the 
wider the impact, according to one study that 
found canopy drying 2.7 kilometres from the 
edge of a highly fragmented forest’. 

The influence of roads in the Amazon could 


even reach around the world. Recent lines of 
research suggest that changes in several factors 
prevent trees in disrupted forests from storing 
as much carbon as they did in the past, a shift 
that could accelerate global warming. 

Greg Asner, a tropical ecologist at the Car- 
negie Institution for Science at Stanford Uni- 
versity in California, studies the chemistry of 
the tree canopy in the Amazon using ground 
plots and airborne spectrometers. He is find- 
ing that the forest canopy along the edges of 
open patches does not seem to hold as much 
water or pigment, such as chlorophyll, as trees 
in unbroken parts of the forest. “Not enough 
chlorophyll and not enough water keep the 
canopy from soaking up carbon dioxide at the 
rate that we know it can, compared to the more 
interior forest,’ he says. 


LINE OF FIRE 

Changes in the Amazon's fire potential are also 
impeding the forest's ability to store carbon. 
Conventional wisdom has long held that the 
rainforest was too humid to burn. But in 2005, 
when drought struck the western Amazon, 
wildfires in the Brazilian state of Acre merged 
into a line 11 kilometres long, with flames leap- 
ing to canopy height, recalls Foster Brown, an 
environmental geochemist at the Woods Hole 
Research Center who witnessed the fires. 
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The flames dam- 
aged more than a 
quarter of a million 
hectares of forest in 
that state alone and 
caused US$100 million in damages. Smoke 
blanketed Rio Branco, the state capital, and 
public-health concerns finally led to ordinances 
to control burning during times of drought. 

Scientists considered the 2005 drought to 
be a once-in-a-century event; some 70 mil- 
lion hectares of forest suffered water stress’, 
and there was significant drying within the 
tree canopy. But five years later, a similar dry 
spell struck, triggering another extreme bout 
of fires. Because they have not evolved in an 
environment frequently beset by fires, trees in 
the Amazon forest are susceptible to heat and 
damage from flames. 

Farther east, in Brazil’s Xingu region, 
researchers saw similar results from experi- 
mental fires during a drought in 2007. Tree 
mortality from heat and fire damage that 
year was more than four times that of a nor- 
mal year’, especially along the forest edge, 
which the researchers burn every three years 
in a cycle emulating traditional Amazonian 
agricultural practices, says ecologist Paulo 
Monteiro Brando of Brazil’s Amazon Envi- 
ronmental Research Institute in Brasilia. In 
the Amazon, burning is the cheapest and most 
effective way for farmers to clear fields and give 
them a nutrient boost before planting crops, or 
rid them of ticks that plague livestock. 

Understanding the future of the Amazon 
means learning how to model not just physi- 
cal and atmospheric processes, but also how 
humans are changing the land, researchers say. 

And as the wider impact of Amazonian roads 
becomes clearer, planners and conservation- 
ists face a dilemma. Although roads threaten 
the forest's health, they also significantly lower 
costs for farmers and businesses, and can make 
a difference between life and death for people 
in remote areas far from hospitals. 

But unrestricted road building could lead to 
irreparable environmental harm, say research- 
ers. “We're looking at a tidal wave of road 
expansion happening in the next few decades,” 
Laurance says. “It’s ecological Armageddon, 
and it’s happening again and again.” m 


The Interoceanic 
Highway has spurred 
deforestation in the 
Peruvian Amazon. 


Barbara Fraser is a freelance writer in Lima, 
Peru. 
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How the ideas Evidence-based Analyse Europe’s 


A road map for 
suicide research 
and prevention 


It is time for policy-makers, funders, researchers 
and clinicians to tackle high suicide rates, say 
André Aleman and Damiaan Denys. 


© 2014 Macmillan Publishers Limited. All rights reserved 


ccording to the World Health Organ- 
Aizen almost 1 million people kill 

themselves every year. That is more 
than the number that die in homicides and 
war combined. A further 10 million to 
20 million people attempt it. 

Suicide is one of the three leading 
causes of death in the economically most 
productive age group — those aged 
15-44 years — and rates have risen since 
the economic crisis triggered by the banking 
crash in 2008 (see ‘Suicide rates in Europe). 
For example, the number of suicides per year 
in the Netherlands rose by 30% between 2008 
and 2012, from 1,353 to 1,753. In the United 
States, the average suicide costs society 
US$1.06 million according to the US Centers 
for Disease Control and Prevention. 

Despite its enormous societal impact, 
little progress has been made in the scien- 
tific understanding or treatment of suicidal 
behaviour. We do know that up to 90% of 
suicides occur in people with a clinically 
diagnosable psychiatric disorder’. Large 
epidemiological studies have shown mental 
disorders, particularly depression and alco- 
hol addiction, to be major risk factors. And 
there is compelling evidence that adequate 
prevention and treatment of such disorders 
can reduce suicide rates’. 

But psychiatry has long neglected the 
topic. Other than as symptoms of border- 
line personality disorder and mood disor- 
ders, suicide, suicide attempts and suicidal 
thoughts were not listed in the fourth edition 
of the Diagnostic and Statistical Manual of 
Mental Disorders (DSM-IV). The DSM-5 
(published last year) does not code suicidal 
behaviour — the most prominent emer- 
gency in psychiatry in primary care. Suici- 
dality is perceived as a medical complication 
rather than as a disorder in its own right. 

For every study on suicidality published in 
the two highest-ranking general psychiatry 
journals (American Journal of Psychiatry and 
JAMA Psychiatry) over the past five years, 
there were six papers on schizophrenia, the 
incidence of which is one-quarter that of 
suicidal behaviour. And, in contrast to the 
studies on schizophrenia, those on suicidal 
behaviour are mostly epidemiological and 
do not investigate underlying mechanisms. 

The lack of suicide research may be due to 
several factors. The first is cultural taboos. 
People are hesitant to talk about the suicide 
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> ofa family member or friend, and many 
religions consider suicide dishonourable. It is 
often deemed unlawful, too. In India and Sin- 
gapore, for example, attempted suicide is pun- 
ishable by up to a year in prison. In several US 
states, suicide is still considered an unwritten 
‘common law’ crime, which can have financial 
consequences for the family. Assisted suicide 
is illegal in many countries. 

Second, the triggers of suicide are com- 
plicated, involving mental-health, financial, 
social, cultural and moral issues. Third, sui- 
cidal behaviour could be difficult to study if 
non-fatal suicidal attempts differ in aetiol- 
ogy from fatal ones. 

What is urgently needed is a road map 
for the systematic study of the mechanisms 
of suicidal behaviour, independent of any 
associated disorder. Only then can evidence- 
based prevention programmes be framed. 

In our view, these four steps are needed. 


FOUR-POINT PLAN 

Define suicide as a distinct disorder. 
Treatment of the mental disorders com- 
monly associated with suicide, such as 
depression, fails to prevent suicidal behav- 
iour in most people. Although suicidality 
touches different medical and psychosocial 
disciplines, it should come under the remit 
of psychiatry, which being at the crossroads 
of mind and brain is well positioned to 
appraise all dimensions. Psychiatry should 
take responsibility for defining suicidality 
adequately, incorporating it in classification 
systems, developing ratings scales to predict 
and assess severity and examining treatment 
options. This will make suicidal behaviour 
visible as a mental disorder. 


Understand the mechanisms. The roots of 
suicide — psychological and neurobiological 


SUICIDE RATES IN EUROPE 


In many nations, deaths from suicide 
increased after the economic crisis 
triggered by the banking crash in 2008. 


2010 data 

Deaths from suicide 
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age distribution. 
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— could stem from difficulties in emotion 


regulation and the underlying brain circuits. 
The most important psychological correlates 
are anxiety, reduced impulse control and 
increased aggression’. Furthermore, people 
showing suicidal behaviour tend to suppress 
their emotions and have difficulties identify- 
ing their feelings*. Suicidality is associated 
with hopelessness, sensitivity towards social 
disapproval and a reduced ability to imagine 
positive future events”. But there are probably 
several routes to suicidal behaviour. 
Research should focus on individual differ- 
ences in cognitive control of emotion. Some 
people may have excessively strong emo- 
tional reactions to challenging events such as 
bereavement or job loss, some may lack cog- 
nitive flexibility and coping skills in the face 
of adversity, and some may show impulsive 
aggressive tendencies. Widely accepted mod- 
els of the role of emotional reactivity and lack 
of cognitive control in setting the stage for sui- 
cidal behaviour need rigorous investigation’. 
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An example of the sort of research that we 
need more of is a neuroimaging study’ by a 
team led by psychiatrist Scott Matthews at the 
VA San Diego Healthcare System in Califor- 
nia. They compared brain activity in combat- 
exposed war veterans considered to be at risk 
of suicide with that in those not deemed to 
be at risk. Both groups had similar levels of 
depression and post-traumatic stress disor- 
der. Members of the suicidal group showed 
stronger activation of the anterior cingu- 
late and prefrontal cortex when they made 
mistakes while performing a concentration 
task. These brain areas are involved in cogni- 
tive control and monitoring of actions. The 
authors hypothesize that the extra effort used 
to process errors during self-monitoring may 
represent a vulnerability that could dent peo- 
ple’s ability to cope with stress. This deserves 
investigation in larger samples, taking into 
account different emotional states. 


Fund suicide research. Governments and 
funding agencies should invest more in the 
topic. The Horizon 2020 European Union 
Framework Programme for Research and 
Innovation should include a challenge 
devoted to suicide research — most urgently, 
the definition of valid criteria for suicidal 
behaviour as a mental disorder and the inves- 
tigation of putative abnormalities in emo- 
tion-regulation circuits associated with such 
behaviour. The Societal Challenges presently 
defined in the framework do not mention 
suicide. The US National Institute of Mental 
Health in Bethesda, Maryland, has requested 
applications to develop ways to screen ado- 
lescents for suicide risk, but larger and more 
comprehensive programmes are necessary. 

Promising in this regard is the institute's 
Research Domain Criteria project, which 
offers funding for the development of ways 
to classify psychopathology according to 
observable behaviour and neurobiological 
measures. For example, apathy is frequently 
observed in psychiatric and neurological 
disorders such as depression, schizophrenia, 
Parkinson's disease and Alzheimer’s disease. 
Because the mechanisms leading to apathy 
may well be the same across these disorders, 
apathy is increasingly being studied as a dis- 
tinct syndrome, irrespective of whether the 
patient has other symptoms of psychiatric 
or neurological disorders. Because suicide 
is a risk in various mental disorders, and 
encompasses neurobiological as well as 
social aspects, it is similarly suitable as topic 
within this project. 


Promote prevention. Governments should 
invest as much in suicide prevention as 
they do in reducing fatal road accidents. 
In 2008-09, UK spending on road-safety 
awareness, including television advertise- 
ments, topped £19 million (US$32 million); 
by contrast, £1.5 million was invested over 
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PREVENTION PAYS 


Economic modelling predicts that in one year, the costs of training UK general practitioners in suicide 
prevention, and the ensuing costs of psychological and pharmaceutical therapy they may prescribe to 
at-risk people, are outweighed by the savings to the public purse owing to the roughly 600 deaths averted. 


Suicide-awareness training -£8.1m 
Suicide-prevention measures -£1.8m 
Emergency treatment : 


Police/coroner costs : 


Funerals: 
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three years in suicide research. Fatal road 
accidents have declined steadily over the past 
decades, whereas suicide rates have levelled 
or even increased. Suicide awareness and pre- 
vention was highlighted in a review published 
earlier this year, which concluded that there 
is a return on investment for several mental- 
health promotion and illness-prevention 
interventions® (see ‘Prevention pays’). 

Risk factors are known from epidemio- 
logical studies, which should help to shape 
developing programmes aimed at preven- 

tion. Notable risk 


“Governments factors are mental 
shouldinvestas _ disorder, previous 
much in suicide suicide attempt, 
prevention anxiety, impulsiv- 
as they doin ity in combination 
reducing fatal with aggressive 
road accidents.” tendencies, family 


history of suicide 
and stressful life events such as job loss or 
divorce. Comprehensive prevention pro- 
grammes should be developed that incorpo- 
rate state-of-the-art knowledge’. 

A good prevention programme would 
increase awareness and mental-health literacy 
in the general population to improve peoples 
understanding of warning signs. Better edu- 
cation is also important for general practition- 
ers (GPs), because many people with suicidal 
thoughts contact their GP in the weeks before 
attempting suicide. Prevention programmes 
would offer clear and easy access points for 
help, and a monitoring service for those at 
risk. Programmes must enlist governments 
and other stakeholders to tackle stigma, a 
major obstacle to suicide prevention. 

Few have systematically implemented 
such programmes. Examples include Fin- 
land, Scotland and the US military. Their 
efforts should now be evaluated to pave the 
way for evidence-based improvements. 

Coordinated efforts are clearly needed 
from public-health authorities, clinicians 
and scientists to understand and prevent sui- 
cide. Researchers should take advantage of 
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progress in neurobiology and neuroimaging 
technology to uncover the brain mecha- 
nisms involved. Clinicians must focus on 
suicidal behaviour as a target of treatment 
in its own right. As Australia’s National Men- 
tal Health Commission put it’: “We can and 
must do better.” m 


André Aleman is professor of cognitive 
neuropsychiatry at the University of 
Groningen, the Netherlands. Damiaan 
Denys is professor of psychiatry at the 
University of Amsterdam, and is at the 
Netherlands Institute for Neuroscience, 
Amsterdam, the Netherlands. 
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CORRECTION 

The Comment ‘Realizing China’s 
urban dream’ (Nature 509, 158-160; 
2014) wrongly stated that more than 
75.8 million hectares of arable land in 
China could be realized by optimizing 
rural residential land use. The correct 
figure is 7.58 million hectares. 
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EVOLUTIONARY BIOLOGY 


Women’s advocates (left to right) Elizabeth Cady Stanton, Antoinette Brown Blackwell and Maria Mitchell. 


Darwin and 
the women 


Sarah S. Richardson relishes a study of how nineteenth- 
century US feminists used the biologist’s ideas. 


wo misplaced narratives dominate 
thinking on the historical relationship 
between feminism and evolutionary 
biology. The first is that nineteenth-century 
Darwinists presented a chorus of sexist views 
of women. The second is that feminism and 
evolutionary biology are wholly independent 
intellectual movements. In From Eve to Evo- 
lution, historian Kimberly Hamlin counters 
these misconceptions with the most compre- 
hensive account so far of how nineteenth- 
century US men and women appropriated 
Darwinian ideas to argue for the equality of 
the sexes in the domestic and public spheres. 
The US women’s movement gathered 
fresh energy in the decades following the 
end of the civil war in 1865, launching calls 
for women’s suffrage, access to property 
rights and education, and the freedom to 
divorce. Hamlin shows how prominent 
women’s rights advocates enlisted science 
“as a force for positive change’, even when 
excluded from lecture halls, as they often 
were. She demonstrates that evolutionary 
science offered US feminists a fresh intel- 
lectual framework from which to challenge 
the biblical dogma that stipulated women’s 
inferiority to men and submissive role in 
domestic life. For social activists such as 
Elizabeth Cady Stanton, evolutionary theory 
“provided a new way for women to view the 
universe and their role in it, and a new lan- 
guage to describe what they saw”. 
As feminists rallied to respond to anti- 
woman screeds such as Harvard-trained 
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physician Edward 
Clarke's 1873 Sex in 
Education, which 
warned that women’s 
‘enfeebling’ men- 
strual cycles legislated 
against their partici- 
pation in higher edu- 
cation, they began to 


From Eve to articulate the need for 
Evolution: Darwin, better science relating 
Science, and to women. With lively 
Women’s Rights : 
in Gilded Age examples, Hamlin 
America relates how educated 
KIMBERLYA. HAMLIN middle- and upper- 
University of Chicago class US women of 
Press: 2014. 


the era, such as phy- 
sician Mary Putnam 
Jacobi, astronomer Maria Mitchell and 
author and suffragist Helen Hamilton Gar- 
dener, argued for the importance of training 
women in science and for a science of sex 
free of misogynistic bias. Gardener’s brain, 
left to science to prove the equality of male 
and female intellect, still stands on display 
at Cornell University in Ithaca, New York. 
Other US feminists summoned the evi- 
dence of biology to argue for more-egalitarian 
marriage and child-rearing arrangements, 
and for the importance of women’s work out- 
side the home. Antoinette Brown Blackwell, 
the first US woman to be ordained a minister, 
cited Darwin's evidence of male provisioning 
and female extra-reproductive labour in ani- 
mals to argue against the Victorian domestic 
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division of the “eight-hour husband” work- 
ing outside the home and the “fourteen-hour 
wife” within it. Feminist intellectual Charlotte 
Perkins Gilman drew on Darwin's sexual- 
selection theory to argue that womens eco- 
nomic dependence on men was unnaturally 
skewing evolution to promote “excessive sex- 
ual distinctions” She proposed that economic 
and reproductive freedom for women would 
restore female autonomy in choice of mate 
— which Darwin posited was universal in 
nature, except in humans — and put human 
evolutionary progress back on track. 

Darwin himself opposed birth control and 
asserted the natural inferiority of human 
females. The adult female, he wrote in The 
Descent of Man (1871), is the “intermediate 
between the child and the man”. Neverthe- 
less, appeals to Darwinist ideas by birth-con- 
trol advocates such as Margaret Sanger led 
one critic to bemoan in 1917 that “Darwin 
was the originator of modern feminism” 

Feminism in the late nineteenth century 
was marked by the racial and class politics 
of the era's reform movements. Blackwell’s 
and Gilman's views that women should work 
outside the home, for example, depended on 
the subjugated labour of lower-class minor- 
ity women to perform household tasks. And 
Sanger’s birth-control politics appealed to 
contemporary fears of race and class ‘sui- 
cide. From Eve to Evolution acknowledges 
this legacy, but does not dwell on it. 

For example, Hamlin argues that the 
anti-biological determinist arguments of 
white nineteenth-century feminists are 
more “nuanced and complex” than gener- 
ally appreciated and were a resource “not 
just for white women but for everyone”. On 
Gilman, Hamlin asserts that “at least most 
of the time, Gilman meant the ‘human race’ 
when she wrote the ‘race’ and that racism is 
not the defining characteristic of most of her 
writings”. Although context is clearly cru- 
cial to a careful reading of this complicated 
intellectual history, it is hard to evaluate such 
claims without a more detailed treatment of 
the words and deeds of these feminist writers 
on the matter of race and class. 

Nonetheless, this deeply researched and 
richly detailed picture of US feminism in the 
late nineteenth century and early twentieth 
century is an important contribution to our 
understanding of the interrelation of gen- 
der politics and science. From Eve to Evolu- 
tion firmly corrects the mistaken view that 
evolutionary biology and feminism are at 
odds. And it reveals a more diverse dialogue 
around the science of sexual equality in the 
era than is generally appreciated. m 


Sarah S. Richardson is a science historian 
specializing in women, gender and sexuality 
at Harvard University in Cambridge, 
Massachusetts. Her latest book is Sex Itself. 
e-mail: srichard@fas.harvard.edu 
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Forgotten prophet of the Internet 


Philip Ball ponders the tale of a librarian who dreamed of networking information. 


he Internet is considered 
a key achievement of the 
computer age. But as for- 
mer New York Times staffer Alex 
Wright shows in the meticulously 
researched Cataloging the World, 
the concept predates digital tech- 
nology. In the late nineteenth cen- 
tury, Belgian librarian Paul Otlet 
conceived schemes to collect, store, 
automatically retrieve and remotely 
distribute all human knowledge. 
His ideas have clear analogies with 
information archiving and net- 
working on the web. Wright makes 
a persuasive case that Otlet — now 
largely forgotten —deserves to 
be ranked among the conceptual 
inventors of the Internet. 

Wright locates Otlet’s work in a 
broader narrative about collation 
and cataloguing of information. 
Compendia of knowledge date back 
at least to Pliny the Elder’s Natural 
History (AD 77-79) and the collec- 
tions of Renaissance scholars such as Swiss 
naturalist Conrad Gesner, although these 
were digests of typically uncited sources. Otlet 
sought to collect everything — newspapers, 
books, pamphlets, photographs — and to 
devise a method of categorization that would 
rival the Dewey decimal system. Wright 
tells a poignant story of the elderly, perhaps 
senile, Otlet stacking up jellyfish on a beach 
and then placing on top an index card bearing 
the number 59.33: the code for Coelenterata 
in his Universal Decimal Classification. 

Otlet envisaged a ‘Mundaneum, a reposi- 
tory of all knowledge. Central to the scheme 
was the Universal Bibliography, a card index 
with more than 15 million entries in filing 
cabinets. Realizing how much space and 
labour such a system demanded, Otlet advo- 
cated the miniaturization of documents (on 
microfilm) and planned automatic systems 
to locate information, like steampunk search 
engines. This knowledge, he thought, might 
be broadcast to users by radio, and stored ina 
workstation called a Mondothéque, equipped 
with microfilm reader, telephone, television 
and record player. 

All this can be correlated with the software 
and hardware of today. But Wright recog- 
nizes that the comparison goes only so far. 
Otlet’s vision was consistent with the social 
climate of his day: centralized, highly man- 
aged and hierarchical. It was quite unlike 
the distributed, self-organized peer-to-peer 


Paul Otlet’s Mondothéque workstation. 


networks made possible by the personal- 
computer revolution that was shaped by the 
counterculture of the 1960s and 1970s. 

The real focus of this story is not the ante- 
cedents of the Internet. It concerns dreams of 
a utopian world order, shared by many around 
the end of the nineteenth century and after 
the First World War. This was Otlet’s grander 
vision, to which his collecting and catalogu- 
ing were merely instrumental. In 1919, with 
politician Henri La Fontaine — a commit- 
ted internationalist awarded the 1913 Nobel 
Peace Prize — Otlet successfully petitioned 
the Belgian government to fund plans to house 
the collection in a wing of a grand building 
in Brussels. He dubbed this space the Palais 
Mondial. The two men 
imagined an ‘intellec- 
tual parliament for all 
humanity, in which the 


organization of knowl- 

edge would contribute 

towards philosopher 

Auguste Comte’ vision 

ofa rationally governed 

society. In part, their 

ideas paved the way for Cataloging the 
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distraught when the a) cy weit 

Paris Peace Confer- Oxford University 
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establish the league in Geneva, in neu- 
tral Switzerland, rather than Brussels. 
But their objective was much more 
grandiose, utopian and strange. 

Progressive thinkers such as 
H. G. Wells (whom Otlet read) desired 
world government in the interwar 
period, but Otlet’s plans often seemed 
detached from mundane realities. 
They veered into mystical notions of 
transcendence of the human spirit, 
influenced by theosophy, and Otlet 
seems to have imagined that learn- 
ing could be transmitted not only by 
careful study of documents but by a 
symbolic visual language in posters 
and displays. In the late 1920s, he and 
architect Le Corbusier devised a plan 
to realize the Mundaneum as a build- 
ing complex full of sacred symbolism, 
as much temple as library. Wright 
overlooks the real heritage of these 
ideas: Otlet’s predecessor here was 
not Gesner but Italian philosopher 
Tommaso Campanella, who in 1602 
described a “City of the Sun” in which citizens 
imbibed knowledge from great, complex wall 
paintings. This aspect of Otlet’s dreams looks 
backwards to Neoplatonism and Gnosticism 
as much as it looks forward to the information 
age and the Internet. Perhaps unsurprisingly, 
politicians remained unconvinced, dooming 
Otlet to frustration and ultimate failure. 

The modern ability to access Isaac New- 
tons Principia online would have delighted 
Otlet. That so much more network traffic 
involves cats and pornography would have 
devastated him. He was devastated enough: 
the actual Mundaneum never amounted to 
more than a corner of the building hosting the 
Palais Mondial, and the government edged 
him out of there in 1924 to make room for an 
exhibition on rubber. After losing funding for 
the cataloguing project in 1934, Otlet clung 
to a corner of the premises until the Nazis 
destroyed most of his collection in 1940. 

The remainder mouldered for decades in 
various buildings in Brussels; what survived 
now sits modestly in the new Mundaneum in 
Mons, a former garage. But there is another 
Mundaneum in Brussels: a conference room 
given that name in Google’s European office. 
It is a fitting tribute, and Wright has offered 
another. m 


Philip Ball is a writer based in London. His 
next book, Invisible, will be out in August. 
e-mail: p.ball@btinternet.com 
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Investigate smog to 
inform policy 


The severe air-pollution episodes 
that affected Europe this spring 
(see, for example, go.nature. 
com/si6mhu and go.nature. 
com/1b7ygf) highlight the need 
for effective policy measures and 
management strategies, given the 
persistence and potential global 
reach of such acute events. 

Reducing particulate matter, 
ozone and greenhouse gases is 
essential to mitigate air pollution 
as well as climate change, so 
policies need to be coordinated. 
For example, climate policy 
encourages use of fuels such as 
diesel (because its combustion 
releases less carbon dioxide 
per kilometre than petrol) and 
biomass, which offsets most of its 
own carbon release as it grows. 
However, both fuels generate 
harmful particulates. 

For now, the factors that 
triggered Europe's latest 
air-pollution episodes and 
influenced their duration and 
spread must be investigated. This 
insight will aid the development 
of much-needed cross-border 
policies that are based on sound 
scientific advice. 

Paul S. Monks* University of 
Leicester, UK. 
psm7@leicester.ac.uk 

*On behalf of 11 co-authors (see 
go.nature.com/nxe3pb for full list). 


Soil observatory lets 
researchers dig deep 


The UK Soil Observatory 
(UKSO) launched last month 

in London. It provides a global 
web-based platform for the 
acquisition and dissemination of 
soil data (www.ukso.org). 

Soil security is rising rapidly 
on government agendas, driven 
by the United Nations Global 
Soil Partnership (see Nature 492, 
186; 2012). Researchers therefore 
need to be able to predict how 
soils will respond to changing 
climate, vegetation, erosion 
and pollution (M. W. Schmidt 
et al. Nature 478, 49-56; 2011). 


The UKSO isa step change in 
meeting this challenge. 

Expanding on the success 
of the mySoil crowd-sourcing 
app (see W. Shelley et al. Nature 
496, 300; 2013), the UKSO isa 
community initiative funded 
by the Natural Environment 
Research Council. It offers live 
data streaming and pioneers a 
route to big data resources. It is 
comprehensively linked to other 
sources of soil data, both free and 
commercial, and to real-time data 
from the COSMOS national soil- 
moisture network. 

The UKSO interactive map 
viewer provides access to a huge 
range of information, from 
local soil biodiversity to metal 
concentrations in topsoil. 

Data from the UKSO will 
underpin major government 
projects, such as the £160-million 
(US$268-million) strategy 
for agricultural technologies 
(go.nature.com/60qmxh), and 
will catalyse associated industrial 
and commercial enterprises. 
Russell Lawley NERC/British 
Geological Survey, Environmental 
Science Centre, Nottingham, UK. 
Bridget A. Emmett, David A. 
Robinson NERC Centre 
for Ecology and Hydrology, 
Environment Centre Wales, 


Bangor, UK. 
davi2@ceh.ac.uk 
Strict vaccine quality 


control in China 


We disagree that vaccine quality 
control is a problem in China 
(Nature 507, S12-S13; 2014). In 
2013, for example, the agency 
responsible for the country’s 
vaccine quality control — the 
National Institutes for Food 
and Drug Control (NIFDC) in 
Beijing — was designated as 

a World Health Organization 
Collaboration Center for 
Standardization and Evaluation 
of Biologicals. 

Since 2002, China has 
implemented batch release of 
vaccines as a key quality-control 
measure to ensure that they 
are safe to use. Each batch is 


screened by the NIFDC before 
its release. China produces 
about 4,000 batches of vaccines 
annually, corresponding to some 
700 million doses. 

In 2011, eleven batches of 
vaccine were rejected because 
they were of sub-standard 
quality; three were rejected 
in 2012 and four in 2013. 
Together, these represent a 
total of 1 million doses and 
correspond to just 0.05% of 
throughput over three years 
—amark of China's stringent 
vaccine-production processes 
and of the success of batch- 
testing practices. 

Zhenglun Liang, Qunying Mao, 
Junzhi Wang National Institutes 
for Food and Drug Control, 
Beijing, China. 
wangjz@nifdc.org.cn 


Rethink Campania’s 
toxic-waste scandal 


As you point out, it is not easy 
to prove a correlation between 
the health issues of people in 
Italy's Campania region and 
their exposure to toxic waste 
(see Nature 508, 431; 2014). The 
authorities seem to prefer to 
blame lifestyle factors, but that 
does not explain why thousands 
of non-smoking and sober 
sheep in the area have had to be 
removed from the food chain. 
This waste crisis is a result of 
certified disposals as well as of 
illegal dumping, currently being 
investigated by the European 
Union project ENTITLE 
(www.politicalecology.eu); the 
crisis has been associated with 
environmental injustice and a 
disregard for democracy (see 
G. D’Alisa et al. Ecol. Econ. 70, 
239-249; 2010). The area has 
become a political laboratory 
for testing extreme regulations. 
It is in a permanent state of 
emergency, with environmental 
and civil-rights laws suspended. 
Rather than abusing 
emergency decrees in this way, 
a different type of laboratory 
is needed, in which social 
and natural scientists can 


accumulate the facts necessary 
to address the injustices 
embodied in the current 
patterns of contamination and 
exposure. 

Giacomo D’Alisa Autonomous 
University of Barcelona, Spain. 
Marco Armiero Royal Institute of 
Technology, Stockholm, Sweden. 
Salvatore Paolo De Rosa Lund 
University, Sweden. 
giacomo_dalisa@yahoo.it 


Internal factors drive 
Chinese patent surge 


China is outpacing many 
countries in patenting renewable- 
energy technologies (see 

J.E. Trancik Nature 507, 300-302; 
2014). But this achievement has 
arguably been inflated by the 
country’s patenting laws. 

For example, according to 
the Espacenet patent database, 
from 1978 to 2008 China was 
awarded 21,192 patents on solar 
photovoltaics. Of these, 99% 
were granted by China’s State 
Intellectual Property Office. 

By contrast, the US Patent and 
Trademark Office granted just 
83 patents in this field over 
the same period (C.-Y. Wu 
Scientometrics 98, 429-446; 
2014). 

The overwhelmingly 
domestic focus of China's 
patenting activity is a result of 
its patent-law system, which is 
designed to promote indigenous 
innovations. Chinese firms tend 
to file as many patents as possible 
for a single equivalent invention, 
encouraged by extra public 
subsidies and administrative 
support (see, for example, 

A. de la Tour et al. Energy Policy 
39, 761-770; 2011). 

This factor needs to be borne 
in mind when evaluating Chinese 
patent figures for the purposes of 
technological ranking. 

Ching- Yan Wu Fu Jen Catholic 
University, Taiwan. 

Mei-Chih Hu National Tsing 
Hua University, Taiwan. 

John A. Mathews Macquarie 
University, Sydney, Australia. 
mchu@m«x.nthu.edu.tw 
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OBITUARY 


Adolf Seilacher 


(1925-2014) 


Palaeontologist who pioneered analysis of trace fossils. 


dolf Seilacher used the simplest of 
Az — careful observation — 

to transform our understanding of 
ancient organisms. His interpretations of 
the enigmatic Ediacara fossils, which date 
from about 578 million years ago, before the 
appearance of the major animal phyla during 
the Cambrian explosion, explained forms 
found among the earliest large organisms. 

Seilacher also showed how trace fossils — 
those that record biological activity such as 
the burrowing of marine animals — reveal 
behavioural traits. He analysed the influ- 
ences that shape invertebrate morphology 
and showed how exceptionally preserved 
fossil assemblages (for which he used the 
term Lagerstiitten) are the result of condi- 
tions such as low oxygen, rapid burial and 
the effect of microbial films that seal the 
sediment surface. 

In a career that straddled the Atlantic, 
Seilacher influenced palaeontology as much 
with his personality as with his publications. 
With a commanding yet engaging presence, 
he had a way of asking questions that caught 
speakers off guard — an occurrence referred 
to as being ‘Dolfed: 

Seilacher, who died on 26 April aged 89, 
was born in 1925 near Stuttgart in Germany. 
He found his first fossil at the age of 14 and 
published his first paper, on fossil sharks 
from local rocks, at 18. He served in the 
German navy in the last years of the Second 
World War before entering the University of 
Tubingen in 1945. 

There, palaeontologist Friedrich von 
Huene taught him to use the camera lucida, 
an apparatus with a prism and mirror that 
projects the image of a specimen onto a sheet 
of paper so that it can be traced. Over the 
rest of his career, Seilacher made thousands 
of drawings in this way, both as illustrations 
for his publications, and as an aid to under- 
standing fossils. He favoured a portable cam- 
era lucida that once belonged to von Huene 
rather than the modern versions that attach 
to binocular microscopes. 

Seilacher remained at Tiibingen for his 
doctorate to study Jurassic and Triassic 
trace fossils (dating from about 252 million 
to 145 million years ago). He spent time 
at the Senckenberg marine station on the 
Baltic Sea, where he learned ‘actuopalae- 
ontology — using the behaviour of living 
organisms as a guide to interpreting fossils. 
In 1951, on an expedition to the Salt Range 
in Pakistan, Seilacher and his PhD adviser 
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Otto Schindewolf discovered trilobite tracks 


in early Cambrian rocks, conveying clues 
about the animals’ lifestyle. In later years, 
Seilacher’s advice was sought by oil compa- 
nies exploring sandstones in North Africa 
and the southwestern United States, because 
he could use trace fossils to interpret the age 
and depositional environment of the rocks. 

Seilacher spent much of his career at 
Tubingen, retiring as a professor in 1990. 
From 1987 to 2009 he spent autumns teaching 
at Yale University in New Haven, Connecti- 
cut. In 1992, he was awarded the Crafoord 
Prize of the Royal Swedish Academy of Sci- 
ences. This enabled him to travel the world, 
often with his wife, Edith, making replicas 
of surfaces that preserve an extraordinary 
range of trace and other fossils and sedimen- 
tary structures such as ripples. This collection 
became his own international touring exhibi- 
tion, entitled Fossil Art. 

Seilacher’s insights fundamentally 
changed the analysis of the evolution of 
form. He broke away from the prevailing 
idea that all morphological features are 
adaptations for some function. He mined 
collections, particularly those of the Pea- 
body Museum of Natural History at Yale, 
for convergent morphologies that provided 
evidence of other influences. No Seilacher 
course on invertebrates was complete with- 
out his demonstration that a water-filled bal- 
loon automatically assumes a form similar to 


© 2014 Macmillan Publishers Limited. All rights reserved 


that ofa regular sea urchin, to show that not 
every shape has adaptive significance. 

Seilacher’s ability to interpret morphol- 
ogy from first principles is best demon- 
strated by his research on the strange fossils 
of the Ediacaran period, with shapes such as 
fronds, spindles and discs. He saw in these 
“an exotic principle of organismic construc- 
tion” consisting of quilted chambers that 
maximize the surface available for feeding 
and respiration. He assigned the Ediacara 
organisms to an extinct kingdom of life that 
he called Vendobionta. The relationships of 
these organisms remain controversial, but 
many examples investigated by Seilacher on 
Mistaken Point in Newfoundland, Canada, 
show a fractal organization that defies easy 
assignment to any living group. 

Among the trace fossils, Seilacher was 
particularly fascinated by Paleodictyon, a 
regular, often hexagonal, mesh-like structure 
that is connected to the sediment surface by 
short vertical tunnels, and occurs in rocks 
from as far back as the Cambrian. Seilacher 
interpreted this marine burrow as a ‘farm’ for 
raising bacteria. 

In 1976, modern examples of Paleodictyon 
were discovered in the deep sea by ocean- 
ographer Peter Rona. Almost 30 years later, 
Seilacher and Rona boarded the submersible 
vessel Alvin and retrieved samples from the 
Mid-Atlantic Ridge at a depth of 3.4 kilo- 
metres. Stephen Low’s 2003 film Volcanoes of 
the Deep Sea pictures the two of them slicing 
soft sediments containing the burrow system 
— sadly, the animal proved elusive. 

An inveterate traveller, Dolf taught and 
did field work on every continent except 
Antarctica. He collected not only fossils, but 
also rugs, ancient cylinder seals and pre- 
Columbian art, and loved to share stories 
while enjoying a glass of wine and a cigar. He 
published works on an extraordinary range 
of fossils over his 70-year career, notably his 
2007 text Trace Fossil Analysis (Springer) and 
his forthcoming Morphodynamics (CRC 
Press). 

His iconic drawings and interpretations 
will remain inspirational for teachers and 
students for years to come. ™ 


Derek E. G. Briggs is director of the 
Peabody Museum of Natural History at 
Yale University in New Haven, Connecticut, 
USA. He taught with Dolf Seilacher at Yale 
from 2003 to 2009. 

e-mail: derek. briggs@yale.edu 
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Keeping one’s sex 


Progeny of the protist Paramecium tetraurelia always retain the parental mating type. This inheritance is revealed to result 
from an RNA-guided DNA- deletion pathway that protects the genome from foreign DNA sequences. SEE ARTICLE P.447 


DOUGLAS L. CHALKER 


pposites attract. For the unicellu- 

lar organism Paramecium tetraure- 

lia, those opposites are two distinct 
mating types, called even (E) and odd (O). 
Sex between E and O cells occurs by conju- 
gation, a reversible cell fusion during which 
cell partners exchange genetic material before 
re-separating into progeny cells. Although 
these offspring all start development with 
identical, mixed genomes, each cell somehow 
retains the mating type of its parent: E begets 
E, and O begets O. The long-standing mys- 
tery of this specific inheritance’ has now been 
solved by a study on page 447 of this issue, 
in which Singh et al.’ report that small RNA 
molecules communicate mating-type-spe- 
cific genetic information between parent and 
progeny cells. 

Paramecium are ciliates, members of a 
phylum of protists that possess both germline 
and somatic genomes housed in function- 
ally distinct nuclei — the micronucleus and 
macronucleus, respectively. During conjuga- 
tion, the micronuclei undergo meiotic division 
to form gametic nuclei, which are reciprocally 
exchanged between the partner cells. Fusion of 
these gametic nuclei generates two genetically 
identical zygotic genomes within the still-con- 
jugated cells. The cells then separate, the origi- 
nal parental macronuclei are discarded, and 
new somatic and germline genomes are created 
from the zygotic genomes in each cell’s unique 
cytoplasm. Although these new genomes 
represent a combination of the two parental 
genomes, the mating type of the progeny cells 
remains the same as before conjugation (Fig. 1). 

Formation of the new somatic genome 
involves extensive genomic remodelling, 
including the excision of transposable ele- 
ments — foreign DNA sequences, also called 
transposons, that can change position in the 
genome. Also removed are tens of thousands 
of short sequences (some as small as 26 base 
pairs), many of which come from within 
protein-coding regions’. To identify the DNA 
to eliminate, a pool of small RNA molecules 
that represents the entire germline genome is 
produced in the micronuclei shortly after con- 
jugating cells pair. These small RNAs are ini- 
tially transported to the parental macronucleus 
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Figure 1 | Epigenetic inheritance of mating type. The unicellular organism Paramecium tetraurelia 
exists in two mating types: E cells, which express the cell-surface protein mtA, and O cells, which do not. 
The cells contain a somatic genome housed in a macronucleus and a germline genome in a micronucleus. 
During sexual reproduction, parental E and O cells conjugate and exchange germline genomes, which 
give rise to new, mixed (bi-coloured) germline and somatic genomes after the cells re-separate. During 
this process, a pool of RNA molecules is generated that targets foreign regions of the new genomes for 
deletion, by comparing them with the parental somatic genome. Singh et al.” show that this RNA-guided 
DNA-deletion pathway results in mating-type inheritance in this organism: the promoter region of the 
gene mtA is always deleted in the progeny of O cells, but never in the progeny of E cells, because it is 


present only in the E cells’ parental genome. 


before it is discarded, and any RNAs for which 
there are matching sequences in the parental 
somatic genome are removed from the pool. 
(The actual comparison probably occurs 
between meiotic small RNAs and somatic tran- 
scripts expressed from the parental genome’*.) 

The remaining small RNAs — which repre- 
sent sequences present in the germline genome 
but absent from the parental somatic genome, 
and which thus primarily correspond to for- 
eign DNA — are bound by molecules called 
Piwi-related proteins and carried to the devel- 
oping somatic genome, where any sequences 
for which there are homologous small RNAs 
are targeted for excision. The end result is a 
reorganized genome that is modelled on that of 
the parent. This RNA-mediated DNA-elimina- 
tion pathway is the ciliates’ evolutionary equiv- 
alent of the Piwi-interacting (pi)RNA pathway 
that silences transposons in the germline cells 
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of multicellular organisms’. Singh et al. show 
that Paramecium has adapted this pathway to 
regulate its sex determination. 

The authors find that E cells are attractive to 
O cells because the E cells express a cell-surface 
protein called mtA. This protein is expressed 
only in E cells because the promoter sequence 
of the mtA gene behaves like a transposon- 
derived sequence in O-cell somatic nuclei 
and so is targeted for deletion (Fig. 1). Dur- 
ing development, E-cell macronuclei do not 
delete the promoter because that sequence 
exists in the somatic genome of the E par- 
ent. Thus, the presence or absence of the mtA 
promoter is communicated between parent 
and progeny genomes by homologous RNAs, 
which effectively block or promote DNA dele- 
tion, respectively, thereby ensuring that mat- 
ing type is propagated to the next generation. 
This process can be described as an epigenetic 


phenomenon, because different patterns of 
gene activity are inherited even though the 
two cells start with the same DNA sequence. 

Although the DNA-elimination pathway 
is known only in ciliates, RNA-mediated 
gene silencing is not, and several studies have 
indicated that small RNAs commonly func- 
tion to mediate transgenerational inherit- 
ance. For example, in Drosophila fruit flies, 
piRNAs produced maternally from regions 
near the ends of chromosomes can silence 
transposons or other homologous sequences 
that reside elsewhere in the genome’. In Cae- 
norhabditis elegans worms, three small-RNA 
pathways cooperate to repress experimentally 
introduced genes that were not present or 
expressed in the previous generation, thereby 
serving to identify new foreign sequences in 
the genome’ ”. In these cases, the expressed 
genome content of the current generation is 
communicated to the next by comparison of 
small RNAs with available mRNA transcripts, 
which are produced from protein-coding 
regions of the genome. In Paramecium, how- 
ever, Singh et al. show that both coding and 
non-coding sequences can be compared, 
because full-genome complements of both ger- 
mline small RNAs and somatic transcripts are 
generated during the cells’ development’. 

The piRNA pathways have largely been 
regarded as a means of distinguishing self 
from non-self and thereby silencing foreign 
genome invaders (such as transposable ele- 
ments). But Singh and colleagues have shown 
that RNA-mediated genome-defence pathways 
can be exploited to evolve modes of epigenetic 
regulation. The co-option of this pathway to 
regulate genes may be more frequent than this 
one case reveals. When the authors examined 
related Paramecium species that also display 
epigenetic inheritance of mating types, they 
found that Paramecium septaurelia does not 
differentially excise the mtA promoter in dif- 
ferent mating types but, instead, O cells delete 
a block of sequence from another gene, mtB, 
which encodes a transcription-regulating 
protein required for mtA expression. The exci- 
sion creates a non-functional version of this 
gene and so mt<A is not expressed. E cells of 
this species, however, retain this sequence and 
express a functional mtB protein and therefore 
also mtA. 

Thus, even for this one trait, the DNA- 
excision pathway has been adapted twice to 
epigenetically regulate two different genes. 
It has taken nearly 75 years to uncover how 
mating type is inherited during sexual repro- 
duction in Paramecium. We can hope that this 
discovery will expedite the identification of 
other examples of cell-intrinsic gene regula- 
tion mediated by homologous RNAs. m 


Douglas L. Chalker is in the Department of 
Biology, Washington University in St. Louis, 
St Louis, Missouri 63130, USA. 

e-mail: dchalker@wustl.edu 
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Windy stars that 
go with a bang 


The probable signature of an ageing massive star’s stellar wind has been detected 
shortly after the star underwent a supernova explosion. The finding suggests that 
such windy stars can have bright, observable deaths. SEE LETTER P.471 


JOHN J. ELDRIDGE 


he most direct indication yet found that 

nearly all massive stars end their lives 

in supernovae is described by Gal- Yam 
et al.' on page 471 of this issue. The authors 
report the detection of emission lines in a spec- 
trum taken just 15.5 hours after a star exploded 
as a supernova, uncovering the nature of the 
progenitor star. The spectral features closely 
resemble those expected for Wolf—Rayet stars 
— ageing massive stars that shed material rap- 
idly in strong stellar winds (Fig. 1). Until this 
event, there was growing evidence that such 


stars were likely to have dim or unobservable 
deaths. 

Supernovae are classified by their observed 
characteristics. For example, those with hydro- 
gen lines in their spectra fall into the type II 
category, and those without them are classi- 
fied as type I. This spectral signature provides 
information about the composition of the star 
that exploded, and the nature of the explo- 
sion. The presence of silicon lines indicates a 
type Ia supernova produced by a dense white 
dwarf star that underwent a thermonuclear 
explosion. 

The more common events, known as core- 


Figure 1 | An ageing massive star. Wolf—Rayet stars such as the one shown here (central bright object) 
are evolved, massive stars that continually lose gas in dense stellar winds (orange and yellow glow). 
Gal-Yam et al.' identified Wolf-Rayet-like wind signatures in a spectrum taken 15.5 hours after a star 
exploded as a supernova, before the expanding supernova ejecta could sweep up the wind. 
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collapse supernovae, include types II, Ib and Ic. 
They do not have silicon lines in their spectra, 
and result from the collapse of the core of a 
massive star. The temperatures at the centre of 
such massive stars are high enough for nuclear 
fusion in their cores to progress from hydro- 
gen all the way to a mixture of iron, cobalt 
and nickel. But fusion of these elements is not 
energetically favourable, and so the core col- 
lapses to a neutron star — or to a black hole 
in the case of the most massive stars. The 
gravitational energy released by the collapse 
is transferred to the material surrounding the 
core, causing the star to explode. 

The only events for which the progenitor 
stars are definitely known are the type II super- 
novae’: they are mainly red supergiant stars, 
which are rich in hydrogen and have cool sur- 
face temperatures. Many of these stars have 
been directly identified in images fortuitously 
taken before the stars exploded. Their masses 
fall into a narrow range, from 8 to 20 solar 
masses’ *. There are also a few examples of rare 
and extreme supernovae that may have had as 
progenitors luminous blue variable stars, which 
are thought to be around 100 solar masses. But 
this is difficult to confirm, because no spectra 
of such progenitor stars have been obtained’. 

According to theoretical models of stel- 
lar evolution, Wolf-Rayet stars, which are 
greater than 20 solar masses, should end their 
lives in core-collapse supernovae and have 
strong stellar winds throughout their lifetime. 
The latter property means that these stars 
will lose all, or nearly all, of their hydrogen 
by the time they explode. The low hydrogen 
abundances make the stars extremely lumi- 
nous and hot, which in turn causes the dense 
stellar wind to be driven from the star’s sur- 
face at a speed of at least 1,000 kilometres per 
second. The signature of the wind is imprinted 
on the spectrum of the star’ in the form of 
broad emission lines, with the widths of the 
lines revealing the wind speed. 

However, until Gal-Yam and colleagues’ 
study, there was no direct evidence that Wolf- 
Rayet stars do explode in supernovae. Statisti- 
cal considerations indicate that pre-explosion 
images of a sample of 12 nearby supernovae 
that are poor in or devoid of hydrogen should 
have revealed at least one, but none has been 
found’. Also, last year’s detection of a super- 
nova progenitor that seemed to have proper- 
ties consistent with those of Wolf—Rayet stars 
is now thought not to be associated with such 
stars’. Their fate has therefore remained uncer- 
tain, and it has been suggested that they end 
their lives in weak or invisible explosions’®. 

By identifying Wolf—Rayet-like wind signa- 
tures in a spectrum of a supernova obtained 
less than one day after the star exploded, Gal- 
Yam and colleagues show that at least one 
Wolf-Rayet star may have ended its life ina 
visible supernova. Key to the authors’ finding 
was both the ability to obtain a spectrum so 
quickly after the explosion, before the stellar 
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wind was swept up by the explosion, and the 
fact that Wolf-Rayet stars have winds that 
extend great distances from the star. Detect- 
ing Wolf-Rayet-like spectral features in an 
early-supernova spectrum is by far the most 
direct evidence that these stars have bright and 
observable deaths. The only way to strengthen 
this conclusion would be to obtain a spectrum 
of the star before it exploded. 

But Gal-Yam and colleagues’ results go 
beyond simply identifying the type of star that 
exploded. The authors have also been able to 
estimate the speed and density of the stellar 
wind. If similar observations are made for 
other ageing windy stars, then we will begin to 
understand the range of possible stellar-wind 
properties that such stars could have shortly 
before they die, and to learn whether these 
death throes are very different and special, or 
only as expected. Either way, they will provide 
important clues about how the most massive 
stars end their lives. m 
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Double function at the 
blood-brain barrier 


Two aspects of the blood-brain barrier — the transport of lipids to the brain and 
the transport of molecules across cells lining blood vessels — have been shown to 
be regulated by the same protein, Mfsd2a. SEE LETTERS P.503 & P.507 


CHRISTER BETSHOLTZ 


he blood-brain barrier (BBB) is a 
double-edged sword. On the one hand, 
this cellular interface helps to maintain 
a constant, optimal environment for neuronal 
function through a combination of barriers 
and selective transport systems that regulate 
the passage of wanted and unwanted mol- 
ecules. But on the other hand, it presents a 
formidable challenge to medicine because it 
stops most drugs from passing from the blood- 
stream to the brain. Two papers in this issue 
add considerably to our rudimentary under- 
standing of the BBB: Nguyen et al.' (page 503) 
unravel how an essential omega-3 fatty acid 
is transported across it, and Ben-Zvi et al.? 
(page 507) identify a mechanism involved in 
suppressing the vesicle-mediated transfer of 
blood-plasma constituents to the brain. 
Astonishingly, these two seemingly 
unrelated processes depend on the same gene, 
Mfsd2a, which encodes a transmembrane pro- 
tein that is specific to the endothelial cells that 
line blood vessels in the brain. Two birds killed 
with one stone, it seems. But the discoveries 
also illustrate the contradictory properties 
of the BBB: Mfsd2a transports lipids that are 
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essential for brain growth and function, while 
at the same time suppressing a transport route 
across the BBB that might be ideal for the 
delivery of complex drugs, such as antibodies, 
into the brain. 

Mfsd2a is a member of the major facilitator 
superfamily (MFS) of secondary-active 
transporters’ — proteins that use the elec- 
trochemical potential of solutes to shuttle 
specific substrates across lipid membranes. 
However, many MFS transporters do not yet 
have described substrates and functions’. Pre- 
vious work has shown that Mfsd2a expression 
is induced by factors in the liver during fast- 
ing’, and the protein has also been implicated 
in antibiotic transport® and in cell fusion in 
the human placenta”. However, Mfsd2a is 
strongly evolutionarily conserved, which 
seems inconsistent with a human-specific 
role in the placenta, and instead suggests that 
its primary function lies elsewhere. Moreover, 
Mfsd2a is found at only low levels in the liver, 
but it is highly expressed throughout the brains 
of several species, and mice lacking Mfsd2a 
have normal liver metabolism but develop 
neurological disorders’. Now, both Nguyen 
et al. and Ben-Zviet al. determine that Mfsd2a 
is specifically and constitutively expressed by 


Tight 
junction 


Target cell 


Vesicle 


Figure 1 | Dual role for Mfsd2a. Lysophosphatidylcholine-docosahexaenoic acid (LPC-DHA) is bound 
to the protein albumin during transport in the blood. When it reaches the blood-brain barrier, LPC-DHA 
detaches from albumin and is adsorbed to the outer lipid leaflet of the cell membrane of endothelial cells. 
Nguyen et al.' show that Mfsd2a binds LPC-DHA and transfers it to the inner lipid leaflet, potentially 
allowing it to bypass the tight junctions between endothelial cells and reach the brain-facing side of the 
cell. The targets of DHA in the brain are unknown, as is the method by which it passes through the 
‘end-feet’ structures of astrocyte cells, which completely cover the brain's blood vessels. Ben-Zvi 

et al.’ also show that Mfsd2a suppresses transcytosis in endothelial cells, a transport mechanism 

by which components of the blood plasma, including proteins, are transferred across the cell in vesicles. 


brain endothelial cells, suggesting that it has a 
role in the BBB. 

Nguyen et al. extended the anatomical and 
behavioural characterization of mice lacking 
Mfsd2a, finding that these animals have small 
brains and a range of motor and cognitive 
defects, and have reduced numbers of certain 
neuron types. Noting that these defects were 
reminiscent of omega-3 fatty-acid deficiency, 
the authors compared the lipid composition 
of brains from Mfsd2a-knockout and control 
mice and found reduced levels of a certain 
omega-3 fatty acid, docosahexaenoic acid 
(DHA), in the mutant animals. In vitro analysis 
revealed that Mfsd2a transports DHA only 
when this fatty acid is attached to the lipid 
lysophosphatidylcholine (LPC). In vivo experi- 
ments confirmed that the brain’s uptake of 
DHA occurs mainly through Mfsd2a-depend- 
ent transport of LPC-DHA (Fig. 1). Although 
the importance of DHA for normal brain 
growth and function was already known”, 
the role of LPC as its carrier across the BBB, by 
means of Mfsd2a, represents a breakthrough in 
our understanding of how essential fatty acids 
enter the brain. 

Ben-Zvi and colleagues approached Mfsd2a 
from a different angle. They searched for 
gene transcripts whose expression by vascu- 
lar cells correlated with the development of 
the BBB. Previous work had indicated that, 
in mammals, the BBB forms, at least in part, 


before birth, but it was not clear exactly when 
this occurs and whether there are regional 
differences. To observe the development of the 
BBB, the authors injected fluorescent tracer 
dyes into the livers of fetal mice, from which 
the dyes entered the systemic circulation. The 
authors were then able to map the maturation of 
the embryonic BBB with high spatio-temporal 
precision and thus to select a suitable develop- 
ment stage for transcript profiling. They found 
that Mfsd2a was prominently overexpressed in 
the brain-cortex endothelium compared with 
the lung vascular endothelium at embryonic 
day 13.5, and that the BBB in Mfsd2a-knockout 
mice was impaired from embryonic day 15.5 
to adulthood. 

The authors also found that, compared to 
control mice, the brain endothelial cells of 
Mfsd2a-knockout mice displayed increased 
transcytosis — the transfer of molecules from 
one side of a cell to another in membrane- 
bound vesicles (Fig. 1). Increased transcytosis 
has also been reported for mice with a reduced 
density of pericytes (cells that wrap around the 
endothelial cells of small blood vessels) in the 
brain’, which led Ben-Zvi et al. to the finding 
that the expression of Mfsd2a in brain endothe- 
lial cells depends on the presence of pericytes. 
This transcytotic route is particularly attractive 
from a drug-delivery perspective because there 
is no evidence for restrictions on the molecu- 
lar mass or physico-chemical properties of the 
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cargo transported by this process’. Targeting 
pericytes as a way of opening the BBB for drug 
passage would be illogical (because these cells 
lie on the brain side rather than the blood side 
of the barrier), but targeting Mfsd2a in the 
endothelial-cell luminal membrane (which 
contacts the blood) could be feasible. 

These two studies provide the first molecular 
handle on lipid and membrane-vesicle trans- 
port across the brain endothelium, but there 
are still several details to be explored. How 
does Mfsd2a regulate endothelial transcytosis? 
Is it a direct mechanism, or does it act indi- 
rectly through deficient lipid transport to the 
brain? Other causes of lipid deficiency, such 
as a dietary lack of DHA, might shed light on 
this question. 

Regarding the transport of DHA across the 
BBB, only the first step has been elucidated so 
far, and it will be interesting to explore how 
the lipid (or lipids) transported by Mfsd2a 
completes its passage across the multicel- 
lular BBB and into the brain. Nguyen and 
colleagues’ findings suggest that Mfsd2a trans- 
locates LPC-DHA from the outer leaflet of 
the endothelial-cell luminal membrane to the 
inner leaflet. Lipids in the inner leaflet, but not 
the outer leaflet'*"’, can bypass the tight junc- 
tions between cells, which may allow diffusion 
of LPC-DHA from the luminal to the ablumi- 
nal membrane, which faces the brain, but how 
further transport of DHA in the brain occurs 
is unclear. The functions of DHA in the brain 
are also not known, although possibilities 
include structural roles in membranes or 
signalling roles in regulating cell behaviour. 
Further studies of Msfd2a-knockout mice 
should allow these and other questions to be 
addressed. = 
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MATERIALS CHEMISTRY 


Selectivity from 


flexibility 


Porous materials called metal-organic frameworks hold promise for many 
applications, including molecular separations. One such material has been 
discovered that shape-shifts to amplify its selectivity for a target molecule. 


RYOTARO MATSUDA 


ne of the most common preconcep- 
() tions about crystals is that they are 

rigid. However, this is not neces- 
sarily the case for crystals of metal-organic 
frameworks (MOFs) — crystalline materi- 
als in which metal ions are linked by various 
organic molecules’. Some MOF crystals can 
flexibly change their crystalline structures 
when they trap or release molecules. Writing 
in Angewandte Chemie, Warren et al.’ describe 
a flexible MOF that reconstructs its molecular 
framework to selectively recognize different 
isomers of xylene, an industrially useful com- 
pound. The finding provides much-needed 
insight into how flexible MOFs should be 
designed to maximize their selectivity for 
target molecules. 

MOE crystals contain thousands of pores, 
in most cases of less than 2 nanometres in 
diameter. These nanoporous materials are 
therefore being extensively studied for poten- 
tial industrial applications such as gas storage, 


molecular separation and catalysis, along with 
other porous materials, for example zeolites 
and activated carbons. At the dawn of MOF 
chemistry in the 1990s, the primary target was 
to fabricate rigid, robust and hard crystalline 
MOFs to achieve highly porous, stable net- 
works’. However, in the 2000s, flexible MOFs 
emerged that could undergo reversible struc- 
tural changes without collapsing. 

The flexible nature of these MOFs mainly 
derives from the ‘coordination’ bonds that 
form between the metal ions and the organic 
linkers, and the large volume of free space 
within MOFs. Because coordination bonds are 
rather weak, bond-breaking and re-formation 
can occur easily in MOF crystals**, and this 
may, in turn, allow dynamic rearrangement of 
the bonding geometry. And because the infi- 
nite molecular network of MOFs is formed 
entirely by coordination bonds, such shifts 
enable the structural transformation of an 
entire crystal. 

The dynamic motion of the organic molec- 
ules in MOF crystals also has a significant role 


Flexible MOF 


para-Xylene 
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Figure 1 | Pore remodelling enhances molecular recognition. Warren et al.’ report a metal-organic 
framework (MOF) containing one-dimensional channels that change shape to bind the para- and meta- 
isomers of xylene. a, Channel restructuring (red and blue arrows) allows an excellent fit to para-xylene 
molecules. b, Such an efficient fit is not possible for meta-xylene, and generates unused voids between 
guest molecules. This difference greatly enhances the MOF’s selectivity for para-xylene. 
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in conferring structural flexibility. Molecules 
that form part of the porous skeleton have less 
contact with other components of the MOF 
than molecules found in most other solids, 
allowing highly flexible rotational and vibra- 
tional movements that enable modulation or 
tuning of the pore structure. 

Initially, flexible transformations were 
observed when solvent was removed from 
or adsorbed to MOFs**. Subsequently, the 
phenomenon was also observed to occur 
when MOFs trapped gas molecules in a dry 
atmosphere””’, which ruled out the possibil- 
ity that the changes were simply the result of 
the framework recrystallizing. Investigations 
of the molecular-trapping mechanism in flex- 
ible MOFs have advanced considerably as a 
result of direct observations'"”” of framework 
changes, and of trapped molecules in the nano- 
pores, using X-ray structural analysis. 

In general, these flexible systems open their 
pores to accommodate guest molecules, then 
return to a closed state once the guests have 
been discharged. However, a complication 
for molecular-separation applications is that, 
once the pores have been opened by the target 
molecules in a mixture, other unwanted com- 
ponents of the mixture can also diffuse into 
the pores, provided that they can fit into the 
available void space within the open-pore form 
of the framework. Crucially, this can diminish 
the selectivity of the MOF towards the desired 
target, and thus reduce its overall utility for 
separation”. It is therefore necessary to estab- 
lish guidelines for designing flexible MOFs 
with superior separation abilities to those of 
existing ones”. 

Enter Warren et al., who describe a MOF 
that takes up the para-isomer of xylene highly 
selectively over the meta-isomer, and that 
separates para-xylene from its other isomers 
more effectively than materials currently in 
use (inorganic zeolites). Xylene is obtained as 
a mixture of isomers from a catalytic process, 
but para-xylene is the most useful, because it is 
a feedstock for the production of terephthalic 
acid — which, in turn, is needed to manufac- 
ture polyethylene terephthalate, a polymer 
commonly used to make plastic bottles. Being 
able to obtain pure para-xylene is therefore 
important; this is not easy, however, because its 
size and physical properties (such as its boiling 
point) are similar to those of the other isomers. 

Not only do Warren and co-workers report 
a high-performance separation of para- and 
meta-xylene, but they also provide clear 
crystallographic evidence of the mechanism 
involved, thus supplying crucial information 
about how to design flexible MOFs. Their 
MOF has two types of one-dimensional chan- 
nel. Notably, the channels can undergo struc- 
tural relaxation involving geometric changes 
around the metal (cerium) ions and rotation 
ofa benzene ring in the organic linkers. 

When the authors modelled the guest- 
free framework computationally as a rigid 


structure, the simulation predicted that meta- 
xylene should be excluded from the pores. But, 
in fact, reconstruction of the framework causes 
channel relaxation and can allow meta-xylene 
to be accommodated, as proved by X-ray crys- 
tallographic analysis. So why does the MOF 
absorb para-xylene selectively? 

Warren et al. performed further detailed 
structural studies and found that channel 
restructuring that results from para-xylene 
accommodation occurs synergistically with 
positive feedback, which enhances the fit- 
ting of the channels to the molecules through 
localized expansion and contraction (Fig. 1a). 
Conversely, accommodation of meta-xylene 
requires a large distortion of the channels and 
occurs with negative feedback, which generates 
unfavourable unused void space and disordered 
guest molecules (Fig. 1b). The researchers 
suggest that the favourable and unfavourable 
channel restructuring for para-xylene and 
meta-xylene, respectively, amplifies the MOF’s 
selectivity in separations of the two isomers. 

Although Warren and colleagues clearly 


show how the MOF accommodates pure para- 
xylene or meta-xylene, it is not clear how the 
channels trap a mixture of both substrates 
during separation. This should be studied in 
future work. Furthermore, the authors studied 
their system mostly at equilibrium states, but 
kinetic studies of the separation will provide 
more practical information for the design of 
flexible MOFs for different applications. 

The conformation of biomolecules such 
as proteins can be flexibly transformed to 
enhance their ability to recognize a specific 
substrate. The new findings prove that this 
concept also applies to flexible MOFs, and 
open up a field of molecular recognition and 
separation by crystalline porous solids. Once 
guidelines have been established for how to 
design frameworks to enhance selectivity 
for a wide range of target molecules, flexible 
MOFs will be highly promising materials for 
molecular separation in industry. = 
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Darwinian tumour 


suppression 


Competition for access to a survival factor has been found to explain why 
incoming cells from the bone marrow replace resident cells in the thymus. 
Reducing this competition can cause tumours to form. SEE ARTICLE P.465 


EDUARDO MORENO 


he idea that cells in the body compete in 
a Darwinian manner — that is, accord- 
ing to the laws of natural selection — 
has been postulated several times. First, in 
1881, zoologist Wilhelm Roux anticipated that 
cells and ‘parts’ of an organism must fight with 
each other to form the body during embryonic 
development. A few years later, neuroscientist 
Santiago Ramon y Cajal predicted that neu- 
rons undergo a competitive struggle for space 
and nutrition. This suggestion gained support 
with neurologist Rita Levi-Montalcini’s discov- 
ery of growth factors, and their integration into 
the ‘neurotrophic theory; which proposed that 
neurons compete for limiting amounts of sur- 
vival-promoting factors, a process that poten- 
tially eliminates unfit cells. On page 465 of this 
issue, Martins et al.' describe another exam- 
ple in which Darwinian competition between 
cells occurs naturally — in this case, serving 
as a potent tumour-suppressor mechanism 
in the thymus. 
The thymus is a specialized organ of the 
immune system. Located in the thoracic cavity, 


its function is to host a type of immature 
white blood cell, the T cell, and to develop 
such cells into mature T cells that are capable 
of recognizing harmful foreign substances. 
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The normal function of the thymus depends 
ona continuous supply of cells from the bone 
marrow’. When these bone-marrow-derived 
progenitors reach the thymus, they replace 
thymus-resident progenitors (Fig. la). 

Martins and colleagues were inspired by 
previous research in Drosophila fruit flies 
showing that slowly proliferating cells are rec- 
ognized and eliminated by cells proliferating 
at a normal rate, through a mechanism pro- 
posed to involve competition for extracellular 
factors’. In a distinctive case of the ‘random 
walks’ by which science moves forward, those 
researchers were themselves motivated by the 
trophic theories* discussed above: that Dar- 
winian-like competition among cells for factors 
required for survival leads to elimination of 
a fraction of the cell population. 


b No competition 


Death 


Tumour 
formation 


Figure 1 | Cellular competition prevents tumour formation. a, T cells develop in the thymus from 
progenitor cells. The thymus is continuously supplied with progenitor cells from the bone marrow, which 
replace thymus-resident progenitor cells. Martins et al.' show that competition between these cells for the 
survival factor IL-7 (not shown) leads to the death of the thymus-resident progenitors. b, The authors also 
show that an absence of competition, when the supply of bone-marrow progenitors is blocked, leads to 
uncontrolled self-renewal of thymus-resident progenitors and T-cell development, such that tumours form. 
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Martins et al. resolved to analyse whether, 
in mice, the normal replacement of thymus- 
resident progenitors by bone-marrow-derived 
colonizing cells shows the hallmarks of this 
type of competition. They found that compe- 
tition for the blood-cell survival factor inter- 
leukin-7 (IL-7) could explain this replacement. 
IL-7 can activate the expression of an intra- 
cellular pro-survival protein, Bcl2 (ref. 5). The 
authors propose that IL-7 availability is limited 
for thymus-resident progenitors and that, in the 
presence of bone-marrow-derived progenitors, 
this leads to lower levels of Bcl2 in thymus- 
resident cells and, therefore, their death. 

The next (and perhaps the most intriguing) 
question the researchers asked was: what would 
be the consequences ofa lack of competition in 
this organ? In other words, what would happen 
in the absence of incoming bone-marrow pro- 
genitors? It was known that in the absence of 
incoming cells, the thymus-resident progeni- 
tors can self-renew and produce T cells®, com- 
pensating for this loss. But the surprise came 
when Martins and colleagues observed that 
the lack of competition from colonizing bone- 
marrow cells resulted in the formation of 
tumours (Fig. 1b), owing to the genetic trans- 
formation of the thymus-resident progenitors. 
Interestingly, the transformation of the thymus- 
resident progenitors produced a tumour type 
that resembled human T-cell acute lympho- 
blastic leukaemia in many aspects, including 
the type of genomic changes, the cells’ gene- 
transcription profiles and the presence of 
activating mutations in the gene Notch1. The 
results suggest that cell competition is required 
to periodically replace thymus-resident pro- 
genitors with fresh bone-marrow-derived pro- 
genitors, and that if this process is interrupted 
the thymic cells become cancerous. 

Thus, this study describes an exciting 
Darwinian mechanism that functions to pre- 
vent cancer in the thymus. But there are sev- 
eral aspects to this process that warrant further 
investigation. For example, it seems from the 
authors’ findings that cells in the thymus have 
a higher propensity to develop tumours than 
cells in other organs, but why might this be the 
case? Is the environment of the thymus cancer- 
promoting — do progenitors suffer cellular or 
genetic insults while performing their tasks 
in the thymus? Or do cells moving from the 
bone marrow to the thymus come with a pre- 
programmed ‘expiry date’ and become malig- 
nant after this? 

It is also apparent from work in Drosophila 
that trophic theories are inadequate to explain 
cell competition. In flies, the amount of sur- 
vival factor for which cells compete does not 
need to be limiting, because there is a fasci- 
nating mechanism — based on ‘fitness finger- 
prints’ displayed on cell membranes — that 
allows cells to compare their fitness directly”*. 
If similar systems enable mammalian cells to 
exchange fitness information, thymic pro- 
genitors might be able to detect and eliminate 
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less-efficient cells even if there is no scarcity of 
the molecules they compete for. One testable 
prediction of this model is that progenitors 
with impaired IL-7 signalling will modify their 
fitness fingerprints accordingly, and actively 
reveal their fitness to neighbouring cells. 

What are the consequences of Martins 
and colleagues’ discovery for cancer treat- 
ment? T-cell acute lymphoblastic leukaemia 
is an aggressive cancer that frequently shows 
more-pronounced resistance to chemotherapy 
than does the more common form of blood 
cancer, B-cell lymphoma. Targeting the genes 
involved in cell competition might provide 
an avenue for treating and diagnosing T-cell 
leukaemia. 

This work may also have immediate con- 
sequences for gene-therapy treatments in 
humans, because the scenario described by 
Martins et al. in mice also occurs in patients 
receiving gene therapy for severe combined 
immunodeficiency — in a subset of patients, 
the genetically modified precursor cells 
that are administered generate T cells in the 
absence of bone-marrow-derived precur- 
sors. It now seems that this lack of competi- 
tion may predispose those patients to tumour 


EARTH SCIENCE 


formation. Despite these open questions and 
the various directions in which this research 
may develop, it is clear that exciting days 
lie ahead for the study of cell competition in 


relation to cancer”. m 
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Seismicity 


An analysis of crustal uplift around California’s San Joaquin Valley, caused by 
groundwater extraction, reveals that such removal leads to both seasonal and 
long-term unclamping of the nearby San Andreas Fault system. SEE LETTER P.483 


PAUL LUNDGREN 


manifestations of plate tectonics. Plate 

boundaries, such as the one between the 
Pacific and North American plates in Califor- 
nia, are broken into multiple blocks separated 
by faults of varying sizes. Our understanding 
of earthquake size and occurrence is gener- 
ally considered from the perspective of a fault’s 
geological history’”, along with its present- 
day geodetic signature of surface deforma- 
tion related to coupling of the fault at depth’. 
Earthquakes cause changes in stress along 
and across fault systems that either pro- 
mote or deter seismicity*. Seismicity and 
fault stresses are considered largely on the 
basis of plate-boundary mechanics, with- 
out taking into account processes at Earth's 
surface, although the past decade has seen 
an increasing recognition that loading at the 
surface may be an important modulator of 
seismicity” *. 


| } arthquakes are one of the most devastating 
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In this issue, Amos et al.’ (page 483) report 
an analysis of crustal uplift surrounding 
California's San Joaquin Valley that has resulted 
from water extraction. The study demonstrates 
that such anthropogenic effects can cause sig- 
nificant unclamping of the nearby San Andreas 
Fault system through flexure of the lithosphere 
(Earth’s crust and uppermost mantle). More- 
over, there is both a seasonal variation in and 
long-term promotion of seismicity associ- 
ated with the water extraction. The latter may 
hasten the occurrence of future large earth- 
quakes in the San Andreas Fault system (Fig. 1), 
although this study does not specify that one 
is imminent. 

The San Joaquin Valley forms the southern 
half of California’s large Central Valley, and 
lies between the Sierra Nevada mountains to 
the east and the Coast Ranges to the west. The 
valley is a significant source of a wide variety 
of crops, but with little precipitation it relies 
heavily on irrigation and thus has been sub- 
jected to long-term water withdrawal since 


the late 1800s. This extraction of 
water exceeds its replenishment 
and results in considerable ground 
subsidence that is visibly evident 
and that has been measured geo- 
detically within the valley itself. 
With the advent of the dense net- 
works of continuous Global Posi- 
tioning System (GPS) stations in 
the western United States, highly 
precise (less than 1 millimetre per 
year) vertical GPS velocities can 
now be obtained”. 

In their study, Amos et al. ana- 
lyse vertical GPS velocities across 
the San Joaquin Valley and find 
that a ring of crustal uplift sur- 
rounds the largely subsiding val- 
ley floor. By examining variations 
in the GPS time series, the authors 
isolate annual seasonal variations 
from each GPS site’s multi-year 
trend. The seasonal variations 
relate to changes in precipitation 
mass in both the surrounding 
mountains and the San Joaquin 
Valley, whereas the multi-year 
trend reflects long-term water 
extraction in the valley. The result 
is a profile of crustal uplift from 
the Pacific coast across the Sierra 
Nevada that is consistent with 
a model for flexure of the litho- 
sphere that is caused by changes 
in hydrologic loading across the 
valley. The authors fit this model 
to the observed vertical GPS 
velocities and estimate changes in 
the stresses (sliding and clamping) 
that drive faults adjacent to the 
valley, including the San Andreas 
Fault. 

Amos et al. find that the flexure serves to 
unclamp and increase the sliding on the San 
Andreas Fault system. The unclamping of 
vertical faults such as the San Andreas occurs 
over the entire seismogenic portion of the fault 
(down to a depth of roughly 15 kilometres). 
Interestingly, the seasonal loading caused by 
water recharge in the San Joaquin Valley, and 
precipitation loading and unloading in the 
adjacent mountains, produces a Coulomb 
stress change (fault-driving stress) of around 
1 kilopascal — a value that is of the same order 
of magnitude as the loading of faults due to 
nearby large earthquakes, and much greater 
than the long-term annual unloading sig- 
nal from groundwater withdrawal. Over the 
long term (a 100-year timescale), the unload- 
ing generated by constant water extraction 
exceeds the seasonal effect and enhances the 
accumulation of stress between the occurrence 
of large earthquakes, bringing faults such as 
San Andreas closer to failure. 

Amos and colleagues’ analysis shows 
how anthropogenic effects, especially on a 


Figure 1 | Aerial view of the San Andreas Fault along the Carrizo Plain, 
central California. Amos and colleagues’ study’ suggests that seasonal and 
decadal water extraction in the nearby southern Central Valley stresses the 
San Andreas Fault system, causing variations in seismicity and promoting 
future earthquakes. 


large spatial scale, can affect seismicity rates 
and potentially long-term earthquake haz- 
ard. Their analysis of vertical GPS velocities 
on the periphery of the San Joaquin Valley 
also shows that crustal uplift in the adja- 
cent southern Sierra Nevada can be at least 
partly explained by groundwater withdrawal. 
Such uplift has previously been attributed to 
changes in glacial erosion during the Pleis- 
tocene geological epoch (between about 
2,588,000 and 11,700 years ago), or to delami- 
nation (peeling away) of the Sierra Nevada 
lithosphere’. 

The ability to perform analyses such as that 
of Amos and colleagues relies on the exist- 
ence of a dense, high-quality, continuous 
GPS network maintained over decades, cou- 
pled with an understanding of groundwater 
extraction. California’s Central Valley is rela- 
tively large, with a simple geometry and faults 
that run parallel to the valley, thus allowing 
a quite simple interpretation of the crustal- 
uplift data and their effect on seismicity. The 
amount of scatter in the vertical GPS velocities 
requires that roughly 200 km of crustal-uplift 
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data be condensed into a single 
profile across the axis of the 
valley, ignoring second-order 
effects due to three-dimensional 
heterogeneity of the withdrawal 
pattern and fault-system geom- 
etry. The authors’ flexure model 
fits the GPS velocities to within the 
data scatter, but does not use more 
realistic lithospheric properties, 
such as the viscous response of the 
lithosphere to temporally varying 
surface loading, that might allow 
better resolution of the fault-stress 
changes. 

Furthermore, Coulomb stress 
change shows only those faults 
promoted to failure; it gives no 
information on the current state 
of stress on a fault or the physics 
of rupture initiation and dynam- 
ics’. Therefore, it is not a quan- 
titative assessment of seismic 
hazard. As the number of dense, 
high-quality geodetic observa- 
tions performed increase across 
the world, more analyses of this 
kind will be possible in regions 
of active tectonics and where 
there is large-scale anthropo- 
genic interaction with the near- 
surface environment’®. This 
will be useful because the role of 
groundwater withdrawal in mod- 
ulating seismicity and earthquake 
hazard is likely to become more 
significant as climate change 
perturbs precipitation patterns 
and as global population growth 
increases the demand for water. m 
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The role of senescent cells in ageing 


Jan M. van Deursen 


Cellular senescence has historically been viewed as an irreversible cell-cycle arrest mechanism that acts to protect against 
cancer, but recent discoveries have extended its known role to complex biological processes such as development, tissue 
repair, ageing and age-related disorders. New insights indicate that, unlike a static endpoint, senescence represents a 
series of progressive and phenotypically diverse cellular states acquired after the initial growth arrest. A deeper understand- 
ing of the molecular mechanisms underlying the multi-step progression of senescence and the development and function 
of acute versus chronic senescent cells may lead to new therapeutic strategies for age-related pathologies and extend 


healthy lifespan. 


ellular senescence is a process in which cells cease dividing and 

undergo distinctive phenotypic alterations, including profound chro- 

matin and secretome changes, and tumour-suppressor activation’ ®. 
Hayflick and Moorhead first introduced the term senescence to describe the 
phenomenon of irreversible growth arrest of human diploid cell strains after 
extensive serial passaging in culture’. Later, this particular type of senes- 
cence (replicative senescence) was causally linked to telomere attrition, a 
process that leads to chromosomal instability and promotes tumorigen- 
esis, supporting the original hypothesis that senescence guards against 
unrestricted growth of damaged cells”*. Subsequent studies have reinforced 
the importance of cellular senescence as a safeguard against cancer”. Emerg- 
ing evidence indicates that the physiological relevance of cellular senes- 
cence extends beyond tumour suppression into biological processes such 
as embryonic development'*"’, wound healing"’, tissue repair'* and organ- 
ismal ageing'>'®. In fact, Hayflick and Moorhead initially postulated a role 
for replicative senescence in ageing, but until recently this theory remained 
untested’. The multifunctional nature of cellular senescence raises the ques- 
tion as to whether fundamentally different senescence mechanisms underlie 
these diverse biological roles. This Review focuses on this and other key 
emerging concepts in the senescence field, including ‘assisted’ cell cycling, 
multi-step senescence (or senescence progression), acute versus chronic 
senescence and senescence of post-mitotic cells. How these concepts relate 
to the role of senescent cells in ageing and age-related diseases and how 
the rapidly accruing new information could be exploited to clear detri- 
mental senescent cell populations selectively to improve healthy lifespan 
are also discussed. 


Causes and effector pathways of senescence 

Research on the causes (or stresses), signalling networks and mechanisms 
underlying the various types of cellular senescence is still in its infancy 
and current insights are largely based on cell culture experiments. In addi- 
tion to telomere erosion, several other tumour-associated stresses have 
been shown to induce a senescent growth arrest in vitro, including cer- 
tain DNA lesions and reactive oxygen species (ROS)'”"'°. What both 
these stresses have in common with telomere damage is that they activate 
the DNA damage response (DDR), a signalling pathway in which ATM 
or ATR kinases block cell-cycle progression through stabilization of p53 
and transcriptional activation of the cyclin-dependent kinase (Cdk) inhib- 
itor p21. Activated oncogenes are also prominent inducers of senescence. 
Oncogenic Ras acts through overexpression of Cdc6 and suppression of 
nucleotide metabolism, causing aberrant DNA replication, formation of 
double stranded DNA breaks (DSBs) and activation of the DDR pathway". 
However, senescence caused by E2F3 activation or c-Myc inhibition is 


DDR-independent and involves p19“" and p16™*** (refs 17, 22). BRAF 
(V600E) is also DDR-independent and induces senescence through a meta- 
bolic mechanism involving upregulation of mitochondrial pyruvate dehydro- 
genase (PDH; Fig. 1)’. Several other studies underscored that senescence 
is closely linked to profound metabolic changes**”. Furthermore, various 
tumour suppressors trigger a senescent growth arrest when inactivated, 
including RB, PTEN, NF1 and VHL'””. Of these, RB inactivation engages 
the DDR’, whereas the others are DDR-independent and act through 
pis" and p16'"“**. A notable species-specific difference is that senes- 
cence pathways of murine cells are more dependent on p19" than senes- 
cence in human cells”’. 

Prolonged exposure to interferon-B also induces senescence, demon- 
strating that chronic mitogenic signalling outside the context of neoplas- 
tic transformation can stimulate senescence**. Other, less broadly studied 
inducers of senescence include epigenetic, nucleolar and mitotic spindle 
stresses (Fig. 1). For example, genome-wide chromatin decompression by 
exposure to histone deacetylase inhibitors triggers senescence via a p21- 
dependent mechanism”. A key target of epigenetic stressors that promote 
senescence may be the INK4a/ARF locus, which in proliferating cells is 
repressed by polycomb group-mediated H3K27 methylation and H2A- 
K119 ubiquitination’®. Nucleolar stress caused by RNA polymerase I inhib- 
itors triggers a robust p53-mediated senescence response”. Senescence can 
also be elicited by suboptimal expression of proteins implicated in spindle 
formation or mitotic checkpoint control, including human TACC3 and 
murine BubR1, Bub3 and Rael, all of which engage p53 and p21 indepen- 
dently of the DDR, often in combination with plom** (refs 15, 32, 33). It 
is highly likely that additional stressors and mechanisms that drive cells 
into senescence will be uncovered given the rapidly evolving nature of the 
field. Production of proinflammatory cytokines and chemokines is emerg- 
ing as a common feature of senescent cells irrespective of the senescence- 
inducing stressor or mechanism (Fig. 1). 

Although the relative contributions of the p53-p21 and plo™**_RB 
effector pathways to the initial growth arrest can vary depending on the 
type of stress, both may ultimately become engaged upon sustained senes- 
cence. For example, DNA damage initially halts cell-cycle progression 
through p53-mediated induction of p21, but if lesions persist, this acti- 
vates p16" through p38-MAPK-mediated mitochondrial dysfunction 
and ROS production***’. The extent to which effector mechanisms of 
in vitro senescence apply to in vivo senescence has not been tested exten- 
sively. Fat, skeletal muscle and eye of BubR1 progeroid mice have elevated 
levels of pla", p53, p21 and ple and are subject to precocious func- 
tional decline’***. Genetic experiments using knockout strains for each of 
these tumour suppressors that dissected how senescent cells accumulate in 
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Figure 1 | Senescence-inducing stimuli and main effector pathways. 

A variety of cell-intrinsic and -extrinsic stresses can activate the cellular 
senescence program. These stressors engage various cellular signalling cascades 
but ultimately activate p53, pl6'™"“*, or both. Stress types that activate p53 
through DDR signalling are indicated with grey text and arrows (ROS elicit the 
DDR by perturbing gene transcription and DNA replication, as well as by 
shortening telomeres). Activated p53 induces p21, which induces a temporal 
cell-cycle arrest by inhibiting cyclin E-Cdk2. p16" also inhibits cell-cycle 
progression but does so by targeting cyclin D-Cdk4 and cyclin D-Cdk6 
complexes. Both p21 and p16" act by preventing the inactivation of Rb, 
thus resulting in continued repression of E2F target genes required for S-phase 
onset. Upon severe stress (red arrows), temporally arrested cells transition into 
a senescent growth arrest through a mechanism that is currently incompletely 
understood. Cells exposed to mild damage that can be successfully repaired 
may resume normal cell-cycle progression. On the other hand, cells exposed 
to moderate stress that is chronic in nature or that leaves permanent damage 
may resume proliferation through reliance on stress support pathways 

(green arrows). This phenomenon (termed assisted cycling) is enabled by 
p53-mediated activation of p21. Thus, the p53-p21 pathway can either 
antagonize or synergize with p16™““* in senescence depending on the type and 
level of stress. BRAF(V600E) is unusual in that it establishes senescence 
through a metabolic effector pathway. BRAF(V600E) activates PDH by 
inducing PDP2 and inhibiting PDK1 expression, promoting a shift from 
glycolysis to oxidative phosphorylation that creates senescence-inducing redox 
stress. Cells undergoing senescence induce an inflammatory transcriptome 
regardless of the senescence inducing stress (coloured dots represent various 
SASP factors). Red and green connectors indicate ‘senescence-promoting’ 
and ‘senescence-preventing’ activities, respectively, and their thickness 
represents their relative importance. The dashed green connector denotes a 
“senescence-reversing’ mechanism. 


these tissues and contribute to their deterioration, established that p 161nk4a 
is an effector of senescence and ageing’*. However, in contrast to in vitro 
findings, p194", p53 and p21 prevented senescence and age-related pathol- 
ogies in vivo'**°. These unexpected findings led to the concept of ‘assisted’ 
cell cycling (analogous to assisted living) in which ageing cells, coping with 
an increasing burden of macromolecular damage and other chronic stresses, 
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manage to retain their proliferative potential for a while by extending cell- 
cycle duration in a p21-dependent manner to provide extra time to mend 
cellular disabilities through engagement of compensatory mechanisms or 
repair (Fig. 1). Although this concept is supported by the observation that 
the cell-cycle time of cultured primary human cells markedly increases 
with passaging”, it clearly requires more validation and generalization. Inac- 
tivation of p21 improves stem cell function in intestinal crypts and bone 
marrow in mutant mice with short telomeres”, indicating that in situations 
where irreparable damage produces a sustained and robust p53 response, 
p21 acts to promote tissue deterioration by executing senescence. 

Invitro studies of cellular senescence have traditionally been performed 
using a single senescence-inducing stimulus (that is, high-dose radiation 
or oncogenes; Fig. 1). However, in the context of organismal ageing, indi- 
vidual cells experience multiple cellular pressures, including various kinds of 
genotoxic, proteotoxic and mitotic stresses**’. Thus, to advance our under- 
standing of these processes, it will be imperative to examine how combi- 
nations of diverse senescence-promoting stressors impact the actions of 
the various downstream effector pathways and the characteristics of the 
resulting senescent phenotypes. Furthermore, while cellular senescence 
is well recognized as an in vivo tumour suppressive mechanism, its irrevers- 
ibility remains a topic of debate. However, compelling new evidence indi- 
cates that BRAF(V600E) oncogene-induced senescence (OIS) can be 
reversed by activation of phosphatidylinositol 3-kinase (PI3K) or inhibi- 
tion of PDH (Fig. 1)’**°. In addition, senescent cells have been success- 
fully dedifferentiated into pluripotent stem cells*’. 


Senescence is a multi-step evolving process 


Until recently, senescence was viewed as a static endpoint. However, several 
recent observations support the hypothesis that senescence can be a highly 
dynamic, multi-step process, during which the properties of senescent cells 
continuously evolve and diversify, much like tumorigenesis but without 
cell proliferation as a driver (Fig. 2)”. The initiating step is the transi- 
tion of temporal to stable cell-cycle arrest, which typically involves prolonged 
inhibition of Cdk-cyclin activity by p21, p16", or both. A change in 
p53 expression from intermittent to continuous may bea critical event in 
the transition from temporal to persistent growth arrest®. 

For the progression to full senescence, it seems that lamin B1 downregula- 
tion triggers both global and local modifications in chromatin methylation’ * 
(Fig. 2). Some mammalian cell types form regions of highly condensed chro- 
matin called senescence-associated heterochromatin foci (SAHFs)*?~*?. 
SAHFs, which are enriched in chromatin modifications such as S83-HP1y, 
HIRA, ASF1, macroH2A, H3K9me3 and yH2AX, sequester genes implicated 
in cell-cycle control, a phenomenon that seems to reinforce the senescence- 
associated growth arrest. Decondensation of (peri)centromeric satellite 
heterochromatin has been identified as a universal hallmark of senescence 
that precedes SAHF formation”. Senescence-related chromatin remodel- 
ling leads to profound transcriptional changes******. Among the assortment 
of upregulated genes is a prominent subset of genes that encode secreted 
proteins, including cytokines and chemokines with proinflammatory prop- 
erties, as well as various growth factors and proteases that together alter 
tissue structure and function. Collectively, these factors are referred to as 
the senescence-associated secretory phenotype (SASP)**”* or senescence- 
messaging secretome (SMS)°’. The SASP is one of the key characteristics 
that distinguish senescent cells from quiescent, terminally differentiated, 
and other types of non-proliferating cells (Fig. 2). In certain cases, the SASP 
is dependent on persistent DNA damage signalling”*, such as that created 
through a positive feedback loop between DDR signalling and ROS”. This 
loop was uncovered using human fibroblast lines in which dominant- 
negative TRF2 or high dose ionizing radiation induced telomere-dependent 
and telomere-independent DDR signalling, respectively. Both types of DDR 
signalling were found to cause mitochondrial dysfunction and production 
of ROS which led to new DNA damage and continued DDR signalling. 
Consistent with this, primary human fibroblasts overexpressing p1 ae 
or p21 undergo senescent growth arrest but fail to activate the DDR and 
do not produce a SASP*. Importantly, several senescence-inducing stimuli 
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produce SASPs independent of DNA damage (Fig. 1), implying the exis- 
tence of DDR-independent mechanisms'"!*”*. 

Given the intricate nature of the SASP, it is not surprising that senescent 
cells impact various biological processes that involve paracrine signalling, 
including cell proliferation, angiogenesis”, inflammation”, epithelial-to- 
mesenchymal transition (EMT)*', wound healing”’, and other types of tissue 
repair'*. Several SASP components, including IL-6, IL-8, WNT16B and 
GROz, also act in an autocrine fashion in the context of OIS, presumably 
to help establish a persistent growth arrest *. Importantly, SASP factors 
vary in distinct cell types and under different senescence-inducing stressors”. 
This plasticity within SASP composition predicts variability with respect 
to the biological processes impacted by different kinds of senescent cells 
(Fig. 2). Proinflammatory cytokines and chemokines are among the SASP 
components that are highly conserved across cell types and senescence- 
inducing stimuli”, suggesting that attracting immune cells and inducing 
local inflammation are common properties of senescent cells. However, 
accumulation of senescent cells is not always accompanied by immune 
cell infiltration and inflammation, as exemplified by melanocytic nevi®. 

Cultured cells usually reach senescence within several weeks after exposure 
to senescence-inducing stressors, but remain viable for months thereafter”. 
Senescent cells continue to evolve even after extended periods of culture, 
thereby progressing to a stage that has been termed ‘deep’ or ‘late’ senes- 
cence (Fig. 2). This phenomenon is evidenced by a dramatic increase in 
the transcription of transposable elements, including members of the L1, 
ALU and SVA transposon families, which occur several months after senes- 
cence onset***, These newly synthesized retrotransposon transcripts can 
indeed engage in active transposition and accumulate in late-senescent cell 
genomes. Increased retrotransposon activity is associated with senescence- 
associated opening of gene-poor heterochromatic regions where these ele- 
ments reside**. A second process driving continued change in senescent 
cells is characterized by the extrusion of chromatin into the cytoplasm, 
resulting in the formation of cytoplasmic chromatin fragments (CCFs)™*. 
CCEs are strongly positive for H3K27me3 and yH2AX, contain DNA, and 
are processed via lysosome-mediated proteolysis, resulting in overall histone 
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Figure 2 | Hypothetical multi-step senescence 
model. Mounting evidence suggests that cellular 
senescence is a dynamic process driven by 
epigenetic and genetic changes. The initial step 
represents the progression from a transient to a 
stable cell-cycle arrest through sustained activation 
of the p16""“* and/or p53-p21 pathways. The 
resulting early senescent cells progress to full 
senescence by downregulating lamin B1, thereby 
triggering extensive chromatin remodelling 
underlying the production of a SASP. Certain 
components of the SASP are highly conserved 
(grey dots), whereas others may vary depending on 
cell type, nature of the senescence-inducing 
stressor, or cell-to-cell variability in chromatin 
remodelling (red and green dots). Progression to 
deep or late senescence may be driven by additional 
genetic and epigenetic changes, including 
chromatin budding, histone proteolysis and 
retrotransposition, driving further transcriptional 
change and SASP heterogeneity (yellow, magenta, 
pink and blue dots). It should be emphasized 

that although the exact nature, number and order 
of the genetic and epigenetic steps occurring 
during senescent cell evolution are unclear, it is 
reasonable to assume that the entire process is 
prone to SASP heterogeneity. The efficiency with 
which immune cells (yellow) dispose of senescent 
cells may be dependent on the composition of 
the SASP. Interestingly, the proinflammatory 
signature of the SASP can fade due to expression of 
particular microRNAs late into the senescence 
program, thereby perhaps allowing evasion of 
immuno-clearance”. 


loss. Both retrotransposon activation and chromatin budding are exam- 
ples of continued genomic and epigenomic remodelling in senescent cells. 
Both processes seem random in nature and are therefore likely to drive 
transcriptome diversity among senescent cells, even in those resulting from 
a common stressor. Because transcriptional activity is a key determinant of 
secretome composition, it is reasonable to assume that the SASPs of these 
cells diversify to some degree through these changes. With the concept of 
senescence progression in cultured cells solidifying, it becomes essential 
to validate its in vivo importance. 


Acute versus chronic senescence 


The diverse nature of the processes in which senescence has been impli- 
cated, ranging from embryonic development’’ to wound healing”, tissue 
repair'‘, cancer and ageing"®, raises the question of whether the properties 
of the senescent cells involved in these activities are fundamentally different. 
If so, what would be the underlying molecular mechanisms? Valuable clues 
to these questions can be inferred from the apparent differences in senes- 
cence kinetics between these processes. In the context of ageing, cells chron- 
ically accumulate macromolecular damage and may become increasingly 
dependent on cell-cycle checkpoints and stress-relief mechanisms to retain 
proliferative potential (assisted cycling)'*°°®. Ultimately, more and more 
of these cells may stably arrest and transition into a senescent state, referred 
to here as chronic senescence (Fig. 3). 

Chronic senescence is different from a process like wound healing, by 
which upon wound closure, myofibroblasts suddenly undergo senescence 
(acute senescence) to limit excessive fibrosis at the site of injury’ (Fig. 3). 
Myofibroblast senescence here is induced by the extracellular matrix protein 
CCNI1, which acts through integrin «681 and HSPS-mediated activation 
of the RAC1-dependent NADPH oxidase 1 to produce a robust and sus- 
tained accumulation of ROS”’. Senescent myofibroblasts limit fibrosis by 
promoting the degradation of matrix components through the SASP factors 
they produce. Importantly, acute myofibroblast senescence also has a role 
in repairing damaged organs such as the liver and thus is likely to repre- 
sent a more common mechanism to limit fibrosis'*. 
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Similar to skin repair and recovery from liver injury, senescent cell 
induction is also acute and spatiotemporally controlled in uterine neo- 
vascularization, a developmental process that takes place at the site of 
embryo implantation to supply the embryo with maternal blood’’. HLA- 
G secreted by embryonic trophoblast cells induces senescence in nearby 
natural killer cells, which then start to produce SASP components to promote 
local angiogenesis and vascular remodelling. Senescence has also been iden- 
tified as a prominent mechanism for remodelling of various tissues during 
mouse embryogenesis, including the mesonephros, the endolymphatic sac, 
the apical ectodermal ridge, and the neural roof plate’”'”. Developmental 
senescence is p21-dependent, but p53- and DDR-independent, and shares 
several features with OIS, including a common gene expression signature 
and senescence-associated [-galactosidase activity. Interestingly, OIS itself 
is triggered by a single defined stimulus and established with fast kinetics, 
which would qualify it as acute senescence. However, cells undergoing OIS 
are not always cleared by the immune system. For instance senescent cells 
in human melanocytic nevi are highly persistent. 

Thus, senescence induction in tissue repair and development seems to 
be a scheduled or programmed process triggered by specific stimuli that 
target particular types of cells (Fig. 3). In contrast, during ageing-related 
senescence, the switch from temporal to persistent cell-cycle arrest appears 
unscheduled and stochastic in nature, probably involving the combined 
effects of distinct senescence-inducing stressors acting simultaneously on 
a cell. The kinetics and efficiency of senescent-cell clearance may constitute 
another key difference between acute and chronic senescence. During repair 
and embryogenesis, disposal of senescent cells seems very efficient and 
under strict temporal control'**. Conversely, ageing-related senescent cells, 
may be more persistent due to deterioration of the immune system with 
ageing”””’, but further experiments are needed to refine our understanding 
of the relationship between senescence and the (ageing) immune system. 

Senescence induced by chemotherapeutics or radiation in the context 
of cancer treatment may be a combination of acute and chronic senescence 
(Fig. 3). Acute senescence would apply to cells that generate a sustained 
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DDRasa rapid response to overwhelming genomic damage”. In contrast, 
chronic senescence would pertain to cells experiencing mild genotoxic 
stress that can initially be managed through engagement of stress support 
pathways. However, as further macromolecular damage occurs over time, 
these cells may eventually transition from a pre-senescent state to a per- 
sistent cell cycle arrest’”’. An interesting untested hypothesis is that cancer 
therapy-induced senescence contributes to accelerated tissue and organ 
deterioration in cancer survivors”*. 


Senescence of post-mitotic cells 


Most cells in mammals are post-mitotic and the question that has been 
raised is whether these cells can obtain key characteristics of senescent cells. 
Post-mitotic neurons in various parts of human and mouse brains are known 
to accumulate high amounts of DNA damage’®. Recent research has revealed 
that these neurons exhibit several additional senescence-associated prop- 
erties, including heterochromatinization, synthesis of proinflammatory 
interleukins, and high senescence-associated B-galactosidase activity’”*. As 
with mitotic cells that undergo senescence in response to sustained DNA 
damage, these phenotypes develop in a p21-dependent manner, further 
associating them with senescence (Fig. 4). Senescence-like features have 
also been reported for adipocytes of mice on a high-fat diet”®, suggesting 
that post-mitotic cell senescence may be a broader phenomenon. It will 
be important to confirm that these terminally differentiated cells produce 
a SASP that negatively impacts the functionality of neighbouring cells 
(Fig. 4), and to explore whether differentiated cells with senescent cell prop- 
erties accumulate in tissues other than brain and fat. 


Senescence in ageing and age-related disease 


The absence of senescence specific markers has hampered efforts to char- 
acterize senescent cells that accumulate in vivo in tissues and organs. The 
most reliable in situ detection methods that are currently available screen 
for multiple semi-selective senescent cell characteristics. These include, but 
are not limited to, high levels of p 16, p21, macroH24A, IL-6, phosphorylated. 


Figure 3 | Acute and chronic senescent cells. The 
conceptual model in which senescent cells are 
subdivided into two main classes based on kinetics 
of senescence induction and functionality. Acute 
senescent cells seem to mostly be part of tightly 
orchestrated biological processes (that is, wound 
healing, tissue repair, embryonic development) to 
halt expansion of certain cells and/or produce a 
SASP with defined paracrine functions. Acute 
senescence is induced through cell-extrinsic stimuli 
that target a specific population of cells in the 
tissue. Acute senescent cells self-organize their 
elimination through SASP components that attract 
various types of immune cells. Induction of chronic 
senescence occurs after periods of progressive 
cellular stress or macromolecular damage when 
tarry cycling transitions into a stable cell-cycle 
arrest. Chronic senescence is not programmed 
and does not seem to target specific cell 

types. Conceivably, owing to age-related 
immunodeficiency or production of less 
proinflammatory SASPs, immune cells may 
inefficiently eliminate chronic senescent cells, 
allowing continuation of multi-step senescence. 
Senescence induced during cancer therapy may 
initially be acute and later chronic in nature. 
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Figure 4 | Mechanisms of tissue and organ deterioration by cellular 
senescence. Cellular senescence is thought to contribute to age-related tissue 
and organ dysfunction and various chronic age-related diseases through 
various mechanisms. In a cell-autonomous manner, senescence acts to deplete 
the various pools of cycling cells in an organism, including stem and progenitor 
cells. In this way, senescence interferes with tissue homeostasis and 
regeneration, and lays the groundwork for its cell-non-autonomous 
detrimental actions involving the SASP. There are at least five distinct paracrine 
mechanisms by which senescent cells could promote tissue dysfunction, 


p38 MAPK, DSBs, and senescence-associated B-galactosidase activity. The 
use of such methods has provided convincing evidence that senescent cells 
indeed accumulate in tissues of humans, primates, and rodents with age” *", 
as well as at sites of tissue injury and remodelling'”"*. Furthermore, cells 
with senescent cell properties can be found in the affected tissues of patients 
with age-related diseases such as osteoarthritis, pulmonary fibrosis, ath- 
erosclerosis, and Alzheimer’s disease’*”. 

Studies on the relevance of in vivo senescence in ageing and age-associated 
diseases have also been complicated by the fact that key downstream effec- 
tors of senescence-inducing stressors such as p 16 and p53 are tumour 
suppressors that, when disrupted in mice, cause death from cancer at a 
young age**. p53 provides additional complications to these analyses as, 
unlike p16'"““*, this transcription factor also mediates apoptosis, making 
it difficult to assign potential ageing-related phenotypic changes in p53-null 
animals to the senescence program* *°. Two consecutive studies in BubR1 
progeroid mice, in which pl6'™“**-positive senescent cells were targeted in 
different ways provided the first direct support for Hayflick and Moorhead’s 
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including perturbation of the stem cell niche (causing stem cell dysfunction), 
disruption of extracellular matrix, induction of aberrant cell differentiation 
(both creating abnormal tissue architecture), stimulation of sterile tissue 
inflammation, and induction of senescence in neighbouring cells (paracrine 
senescence). An emerging yet untested concept is that post-mitotic, terminally 
differentiated cells that develop key properties of senescent cells might 
contribute to ageing and age-related disease through the same set of paracrine 
mechanisms. 


early concept that senescent cells drive age-related pathologies’”!>"®. In 
the first study, genetic inactivation of p16""“* prevented the formation 
of senescent cells in skeletal muscle, eye and fat, significantly attenuating 
the onset of age-related pathologies in these tissues'’. In the subsequent 
study, which produced a phenocopy of genetic p16""“* ablation, p16'"™**- 
positive senescent cells were allowed to accumulate but were consistently 
eliminated from weaning age onwards by the use of a transgene, termed 
INK-ATTAC, that selectively induced apoptosis in these cells upon admin- 
istration of the synthetic drug AP20187 (ref. 16). Late-life clearance of 
senescent cells attenuated progression of already-established age-related 
disorders in skeletal muscle and fat, yet was unable to revert them. 

An important question that needs to be addressed is how senescence 
promotes age-related tissue dysfunction (Fig. 4). One scenario is that senes- 
cence contributes to the overall decline in tissue regenerative potential that 
occurs with ageing. This idea is supported by the observation that pro- 
genitor cell populations in both skeletal muscle and fat tissue of BubR1 
progeroid mice are highly prone to cellular senescence”. In addition to 
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acting on stem cells in a cell-autonomous fashion by establishing a per- 
sistent growth arrest, senescence could act to disrupt the local stem-cell 
niche non-autonomously through the SASP*’* (Fig. 4). Although this 
concept remains to be tested in vivo, the profound negative impact that 
the aged cellular microenvironment has on stem cell functionality is under- 
scored by the discovery that the regenerative potential of old stem cells 
markedly improves when exposed to a young systemic environment via 
parabiotic pairing*”””. 

Other SASP-based mechanisms may also contribute to tissue dysfunc- 
tion. For example, proteases chronically secreted by senescent cells may 
perturb tissue structure and organization by cleaving membrane-bound 
receptors, signalling ligands, extracellular matrix proteins or other com- 
ponents in the tissue microenvironment”*”’ (Fig. 4). In addition, other 
SASP components, including IL-6 and IL-8, may stimulate tissue fibrosis 
in certain epithelial tissues by inducing EMT®"”! (Fig. 4). Chronic tissue 
inflammation, which is characterized by infiltration of macrophages and 
lymphocytes, fibrosis and cell death, is associated with ageing and has a 
causal role in the development of various age-related diseases”. One idea, 
which remains untested, is that senescent cells that accumulate with age- 
ing and that are present at sites of age-related pathologies promote this 
type of inflammation through the proinflammatory growth factors, cyto- 
kines and chemokines they secrete (Fig. 4). These may include GM-CSF, 
GROu, IL-1, IL-6, IL-8, macrophage inflammatory proteins (MIPs), as 
well as monocyte chemo-attractant proteins (MCPs)"**”°. Together with 
matrix metalloproteinases, proinflammatory SASP components are thought 
to create a tissue microenvironment that promotes survival, proliferation 
and dissemination of neoplastic cells, which may explain, at least in part, 
why cancer rates markedly increase beyond middle age’**™*. Finally, the 
SASP may intensify age-related tissue deterioration through paracrine 
senescence, a recently discovered mechanism by which senescent cells spread 
the senescence phenotype to healthy neighbouring cells through secre- 
tion of IL-1B, TGFB and certain chemokine ligands (Fig. 4)”””’. 

Why senescent cells accumulate in tissues and organs with age is ano- 
ther key open question. One possibility is that the rate with which senes- 
cent cells are produced might increase over time. In support of this idea, 
several investigations have demonstrated that various stimuli that induce 
senescence increase with ageing**”*”*. If combined cellular stresses were 
to drive senescence, it would take a long time for these to accumulate. Alter- 
natively, the efficiency with which senescent cells get eliminated may decrease 
with ageing. In fact, senescent cells can be killed and disposed of by immune 
cells, as was elegantly demonstrated in mice that undergo senescence in 
the context of liver fibrosis and hepatocellular carcinogenesis*”*. These 
observations raised the possibility that senescent cells are armed with a 
self-elimination program that proceeds by attracting both adaptive and 
innate immune cells, including T cells, macrophages and natural killer cells, 
through the secretion of proinflammatory cytokines and chemokines®*””. 
This program is likely to be affected in ageing humans and rodents, in which 
the immune system undergoes a complex series of changes in both the innate 
and adaptive immunity that culminate in age-associated immunodeficiency”. 
This may include a reduced efficiency of senescent-cell clearance. Indeed, 
age-related haematopoietic stem cell dysfunction compromises the immune 
system and may thus be an important contributor to the late-life systemic 
increases in senescent cells””’*. Furthermore, given that senescence may 
be a dynamic process rather than a static state, it is conceivable that the 
ability to self-eliminate through immune cells becomes compromised as 
senescent cells evolve. 

Another important consideration is whether chronic and acute senes- 
cence can both play a role in ageing and age-related disease. Acute senes- 
cent cells are part of tightly orchestrated biological processes in which they 
have narrowly defined roles and a temporal presence both dictated by the 
composition of the SASP, limiting their ability to accumulate with ageing. 
In contrast, chronic senescent cells which develop after a prolonged period 
of gradually increasing cellular stresses are expected to exhibit high SASP 
heterogeneity due to a more complex and diverse spectrum of effector path- 
ways involved in establishing this type of longer lasting senescent state. 
SASP heterogeneity may therefore be a mechanism to create subsets of 
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senescent cells that are highly resistant to immune clearance and drive 
tissue degeneration. 


Senescent-cell clearance and future directions 


Studies in BubR1 progeroid mice provided proof-of-principle that clear- 
ance of senescent cells can delay age-related degenerative pathologies’®. 
This, together with the lack of overt detrimental side effects associated 
with long-term clearance in BubR1 mutant mice, suggests that targeting 
senescent cells for destruction might be an effective therapeutic strategy 
for treatment of age-related diseases or improvement of healthy lifespan. 
Evidently, sophisticated approaches developed for selective eradication 
of cancer cells provide an invaluable blueprint for the development of 
molecular-targeted therapies against senescent cells. The problem of drug 
resistance that has plagued the cancer field is unlikely to apply to senolytic 
agents because rare residual senescent cells that remain after treatment 
would not be able to amplify through division. 

However, although senescent-cell removal represents an attractive ther- 
apeutic avenue, there are many unknowns and potential pitfalls along this 
route. For example, our current knowledge about the rates and spatio- 
temporal patterns that drive the accumulation of senescent cells in both 
humans and animal models during normal ageing and in age-related dis- 
eases is limited. Another gap in knowledge relates to the degree of pheno- 
typic heterogeneity (that is, SASP composition) between senescent cells 
that accumulate in vivo, not only between the acute and chronic senescent 
cells but also within these two classes. Also, it will be imperative to deter- 
mine the impact of senescent cell clearance on the health and lifespan of 
normal mice, particularly now that evidence is mounting that senescence 
is beneficial for tissue development and repair. 

Another important consideration is whether the mouse is a reliable 
model for recapitulating the physiological effects of senescence cell accu- 
mulation and clearance that occurs in humans. One prominent senescence- 
inducing stressor, telomere attrition, is specific to humans, and may be 
responsible for a higher baseline level of senescence in our species. If similar 
to telomere attrition, other senescence-inducing mechanisms were indeed 
more prevalent in humans than in mice, the therapeutic effect of senescent 
cell clearance might be even more robust than in mice. Conversely, we 
should consider that, in spite of its potential beneficial effects, the removal 
of high percentages of senescent cells could have undesirable outcomes to 
human health by triggering atrophy and tissue dysfunction. In addition, 
the most illuminating experiments in mice relied on targeting p16"™**- 
positive cells for elimination, a population that probably represents only a 
subset of cells undergoing senescence. Thus, the effects of clearing either 
pl 6'™2_negative senescent cells or, collectively, all senescent cells remain 
to be determined. In addition, cultured human and mouse cells differen- 
tially rely on the p53 and p16"™““* pathways for induction of senescence”’, 
underscoring the need for carefully validating information from mouse 
studies before extrapolating its results to the human situation. We should 
also consider that although the existence of multiple senescent-cell sub- 
types offers at the same time challenges and opportunities, targeting each 
of these populations separately might have either beneficial or detrimen- 
tal effects. 

The field of experimental therapeutics as it relates to senescence is a 
nascent yet promising area of investigation. Besides small molecules that 
target senescent cells, a potentially promising and straightforward bio- 
therapeutic approach would be to activate or reinforce the immune response 
against senescent cells. This approach will require a deeper understand- 
ing of the extent to which the immune system disposes of senescent cells 
as well as the molecular and cellular mechanisms underlying this process. 
Potential caveats are that immunodeficiency is not a main contributor 
to age-related increases in senescent cells, or that chronically senescent 
cells are enriched in SASP components that provide resistance to immune 
clearance”. Approaches exploiting the adaptive immune system to mount 
effective immunity against senescent cells may have limited feasibility owing 
to a potential lack of senescent-cell-specific antigens. 

Undoubtedly, the next decade will see a tremendous expansion of data 
on the mechanisms, characteristics and functions of in vivo senescence, as 
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well as the use of this information to ameliorate human age-related diseases 
and promote healthy lifespan. 
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Genome-defence small RNAs exapted for 
epigenetic mating-type inheritance 
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In the ciliate Paramecium, transposable elements and their single-copy remnants are deleted during the development of 
somatic macronuclei from germline micronuclei, at each sexual generation. Deletions are targeted by scnRNAs, small RNAs 
produced from the germ line during meiosis that first scan the maternal macronuclear genome to identify missing sequences, 
and then allow the zygotic macronucleus to reproduce the same deletions. Here we show that this process accounts for the 
maternal inheritance of mating types in Paramecium tetraurelia, a long-standing problem in epigenetics. Mating type E 
depends on expression of the transmembrane protein mtA, and the default type O is determined during development by 
scnRNA-dependent excision of the mtA promoter. In the sibling species Paramecium septaurelia, mating type O is determined 
by coding-sequence deletions in a different gene, mtB, which is specifically required for mtA expression. These independently 
evolved mechanisms suggest frequent exaptation of the scnRNA pathway to regulate cellular genes and mediate transgene- 
rational epigenetic inheritance of essential phenotypic polymorphisms. 


Ciliates are complex unicellular eukaryotes that use different types of nuclei 
within the same cytoplasm to separate germline and somatic functions’. 
The diploid micronuclei (MICs) undergo meiosis to provide gametic nuclei 
during sexual events, but their genome is not expressed. Genes are ex- 
pressed from the polyploid macronucleus (MAC), which is not transmit- 
ted across sexual generations. After meiosis and fertilization, the parental 
MAC is lost and replaced by a new one that develops from a mitotic copy 
of the zygotic nucleus. Macronuclear development involves extensive 
rearrangements of the germline genome, including the elimination of 
virtually all transposable elements and other repeats*. Furthermore, in 
P. tetraurelia ~45,000 short, single-copy internal eliminated sequences 
(IESs) are precisely excised from coding and non-coding sequences** by 
the domesticated transposase Pgm’. IESs are invariably flanked by two 
5'-TA-3’ dinucleotides which recombine into one after excision*. A short 
consensus adjacent to the TAs (5’-TAYAGYNR-3’) is reminiscent of the 
ends of Tcl/mariner elements**, and a recent study provided support for 
the hypothesis that IESs are degenerate remnants of ancient transposable 
element insertions*”. 

Allintragenic IESs identified so far must be excised to reconstitute func- 
tional genes in the MAC. However, the poorly conserved IES end con- 
sensus is not sufficient to specify the excision pattern genome-wide’, and 
a specific class of small RNAs is required to identify some IESs on the 
basis of their absence from the parental MAC*”®. In the current ‘genome 
scanning’ model, snRNAs are produced from most of the germline gen- 
ome during MIC meiosis'"” and are then filtered by pairing interactions 
with nascent transcripts in the parental MAC", which acts as a sponge to 
remove matching scnRNAs from the active pool. Those that cannot find 
a match remain free to target homologous sequences in the zygotic MAC 
when it develops, thereby recruiting the IES excision machinery. This 


RNA-mediated genomic subtraction thus reproduces the deletions 
observed in the parental MAC and can account for epigenetic inheritance 
of alternative rearrangement patterns, such as retention of a given IES in 
the MAC**”®, or deletion of a given gene”’, across sexual generations. 

In Paramecium, conjugation (the reciprocal fertilization of cells of 
opposite mating types) does not allow any significant exchange of cyto- 
plasm between the mates, so that the pools of smRNAs produced during 
meiosis are independently sorted in each cell. After fertilization, each 
developing MAC will thus reproduce the particular rearrangements pre- 
sent in the old MAC of its own cytoplasmic parent. This mechanism 
might underlie the maternal (cytoplasmic) inheritance of mating types 
in P. tetraurelia, one of the earliest cases of transgenerational epigenetic 
inheritance in any eukaryote. 

Although mating types were discovered in 1937 (ref. 17), so far the only 
available test relies on the ability of type O (odd) to agglutinate with type E 
(even) when vegetative cells become sexually reactive, a physiological state 
induced by mild starvation. Agglutination is a prerequisite for conjugation 
and occurs on contact through adhesion of ciliary membranes. Mutational 
analyses showed that several genes are specifically required for expression 
of type E'**°; mutations in these genes result in a constitutive O pheno- 
type, which thus seems to be a default state. In P. tetraurelia, mating types 
are not genetically determined in the MIC’. Each new MAC becomes 
determined for one type during its development, and remains the same 
throughout vegetative growth of the derived clone. The O/E alternative is 
not random, but maternally inherited; experiments showed that mating- 
type determination in the developing zygotic MAC is controlled through 
the cytoplasm” by the maternal MAC”. A pleiotropic mutation enfor- 
cing constitutive determination for type E was later found to impair a limited 
subset of genome rearrangements”, suggesting that type O is normally 
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Figure 1 | Structure and expression of mtA and related proteins. 

a, Recognizable protein features. b, Northern blot analysis of expression of mtA, 
mtAL1 (GSPATG00025159001) and mtAL2 (GSPATG00002922001). The 
panels on the left compare total RNA samples from sexually reactive cultures of 
the two mating types (reactive, O and E). In the panels on the right, RNA 
samples were extracted from: V, vegetative, exponentially growing cells; R, 
mildly starved and sexually reactive; S, starved for 48 h and no longer reactive. 
The three mRNAs migrate just above the large rRNA, as can be seen from the 
pattern of background hybridization. 


determined by the rearrangement of one of the genes required for E 
expression”®. 


mtA rearrangements determine mating type 

To identify the putative mating-type-determining gene, we used a whole- 
genome microarray to compare the transcriptomes of sexually reactive 
cells of both mating types (not shown). The gene found to have the largest 
E/O expression ratio, GSPATG00017533001, encodes a 1,275-amino-acid 
protein with a signal peptide and cysteine-rich furin-like repeats followed 
by 5 transmembrane segments at the carboxy terminus (Fig. 1a and Supple- 
mentary Data 1a); the amino-terminal part is predicted to be outside the 
plasma membrane. This gene was later shown to be mtA (ref. 19) (see 
below). Northern blot analyses confirmed that it is expressed only in 
E cells and further showed that expression is limited to sexual reactivity, 
as transcripts could not be detected in exponentially growing cells or in 
over-starved cells (Fig. 1b). Genes encoding structurally similar proteins 
were also found to be specifically expressed in sexually reactive cells, but 
in both mating types (Fig. 1b). RNA interference (RNAi)-mediated silen- 
cing of mtA in E cells resulted in the default O phenotype during sexual 
reactivity (as determined by their capacity to agglutinate with E but not 
with O tester lines), indicating that mtA is required for E expression 
(Supplementary Table 1). 

To understand the molecular basis for mating-type-E-specific express- 
ion, we sequenced the mtA gene from the MACs of O and E cells, and the 
unrearranged MIC version. The MIC gene is interrupted by four IESs that 
are excised in both MAC types; however, a 195-base-pair (bp) segment 
containing the transcription start site and the first 26 bp of the coding 
sequence was found to be excised as an IES in mating type O MACs, but 
retained in E MACs (Fig. 2a-c and Supplementary Data 1a). This seg- 
ment contains the mtA promoter, as indicated by microinjection of diffe- 
rent constructs into the MACs of O cells (Extended Data Fig. la—c): its 
presence upstream of the coding sequence was sufficient for transformed 
clones to express mating type E instead of mating type O. An mtA-GFP 
fusion protein was detected in cilia of the anteroventral surface, although 
an excess of GFP fluorescence was seen in cytoplasmic structures, prob- 
ably the ER (Fig. 2d, e). The fusion protein could be detected on cilia with 
anti-GFP antibodies in fixed, non-permeabilized cells (Fig. 2f), confirm- 
ing that its N-terminal part is exposed outside of the ciliary membrane. 

When mtA-transformed clones were taken through autogamy (a self- 
fertilization sexual process), the induced type E was robustly transmitted 
to progeny (Extended Data Fig. 1b). As observed for maternally controlled 
IESs'*"°, a plasmid containing only the 195-bp segment was sufficient 
to inhibit excision of the homologous sequence during development of 
zygotic MACs, causing the progeny of transformed O cells to switch to E 
(Extended Data Fig. 1d). Retention of the mtA promoter thus recapitulates 
both aspects of developmental determination for E, namely the capacity to 
express mating-type E during the vegetative phase and the capacity, after 
sexual events, to direct determination of new MACs for the same type. 


448 | NATURE | VOL 509 | 22 MAY 2014 


a ATG TGA 
MIC — 
MAC E 
PCR1 
MAC O —— 4 kb 
<> PCRI 
b 
OEOE M OO E E pO pE 
0.5— ——— 
kb 2 kb ao 
0.2 = 
1.5- 


4 E 
Figure 2 | Structure of mtA in the MIC and in the MACs of E and O cells. 
a, Black boxes are IESs; the 195-bp segment (grey box) is excised only in O 
MACs. A small unannotated gene downstream of mtA is conserved in other 
species. b, Southern blot of EcoR1-digested total DNA, hybridized with an mtA 
probe (PCRS) revealing fragments of 1,945 and 2,140 bp in O and E clones. 
c, PCR amplification of the mtA 5’ end (PCR1) on independent pools of O and 
E clones (266 and 461 bp, respectively). pO and pE, control PCRs on cloned 
MAC versions; M, size markers. d—f, Localization of an mtA—GFP fusion 
protein, with GFP inserted between furin-like repeats and transmembrane 
segments. d, DIC image. Scale bar, 20 pum. e, Intracellular confocal optical 
section. Some GFP fluorescence (green) is detected in cilia of the anterior (top), 
but not posterior, part of the cell. The MAC is stained with Hoechst (blue). 
f, Confocal image of the ventral cell surface, showing the opening of the oral 


apparatus. The fixed, non-permeabilized cell was labelled with anti-GFP and 
secondary antibodies (red). 


Expression and inheritance uncoupled 


Previous mutational analyses showed that the expression of mating types 
during sexual reactivity can be uncoupled from the mechanism that en- 
sures their transgenerational inheritance” *’”. The recessive mutations 
mtA°, mtB° and mtC? all preclude expression of type E and restrict cells 
to the default O phenotype, but they do not affect the maternal inher- 
itance of mating-type determination: mutant homozygotes formed in an 
E cytoplasmic lineage, despite expressing O, will keep the memory of E 
determination for an indefinite number of sexual generations, as shown 
by the fact that their progeny switches back to E expression when the 
wild-type allele is reintroduced by conjugation’® (Extended Data Fig. 2a-d). 

To determine whether this memory is attributable to retention of the 
mtA promoter, we crossed each of the mutants to the wild type. Autogamy 
of F, heterozygotes of mating-type E yielded homozygous F, progeny with 
a 1:1 ratio of E- and O-expressing clones, reflecting the Mendelian segrega- 
tion of wild-type and mutant alleles. Both types of clones retained the mtA 
promoter (Extended Data Fig. 2e). That the O-expressing mutant clones 
could still transmit E determination was verified by crossing them again to 
wild-type E cells: the second-round F, heterozygotes derived from both 
parents always expressed E and retained the mtA promoter (Extended Data 
Fig. 2f). Autogamy of these F, heterozygotes again resulted in the expected 
Mendelian segregation of E- and O-expressing clones among second- 
round F, homozygotes (Supplementary Table 2). Thus, independently 
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Figure 3 | Molecular analysis of expression mutants. a, mtA expression in 
E-determined mtA°, mtB° and mtC° homozygotes. The top panel shows a PCR 
analysis (PCR1) of E- and O-expressing F; lines from the second round of 
backcross of each mutant to wild-type E cells. pO and pE, control PCRs on cloned 
MAC versions of mtA. All clones retained the mtA promoter. The wild-type 
(WT) or mutant (m) genotypes were confirmed by sequencing. Middle and 
bottom panels show northern blots of total RNA samples from sexually reactive 
cultures of each F; line, hybridized with mtA or mtAL1 probes. b, Northern blot 
analysis of mtA expression in sexually reactive cultures of E-determined mtB° 
and mtC° clones transformed with plasmids pmtB°'-NGFP and pmtC”, 
respectively (see Extended Data Fig. 3). Transformed clones expressed mating 
type E, whereas uninjected controls (C2 and C1) expressed mating type O. 

c, Fluorescence microscopy of an mtB° cell transformed with pmtB°'-NGEP, 
expressing mating type E (GFP and DAPI filters). Scale bar, 20 jim. 


of the mating type being expressed, maternal inheritance of mtA promoter 
retention may indeed underlie the epigenetic memory of E determination. 

To understand how mutant clones can express mating type O despite 
retaining the mtA promoter, we tested mtA expression in E- and O- 
expressing lines, using F, populations obtained by an additional autogamy 
of second-round F, homozygotes. In the case of the mtA° mutation, 
northern blots showed that mtA (GSPATG00017533001) transcripts were 
produced in sexually reactive cultures of O-expressing lines (Fig. 3a), pro- 
mpting us to re-sequence the gene in mutant clones. A substitution was 
found to change Arg codon 751 to a stop codon, and the mutation co- 
segregated with mating-type O expression among second-round F, homo- 
zygotes (Supplementary Table 2a). This revealed that GSPATG00017533001 
is mtA and confirmed that the encoded protein is required for expression 
of mating type E. The mutation lies at a distance from the mtA promoter 
and apparently does not affect the regulation of its excision, explain- 
ing how O-expressing mutant homozygotes can transmit either O or E 
determination. 

In contrast to mtA? lines, mtB° and mtC° O-expressing lines did not 
produce mtA mRNA on sexual reactivity (Fig. 3a). The mutations were 
identified by whole-genome sequencing, and found to be substitutions 
in GSPATG00026812001 and GSPATG00009074001, respectively (Sup- 
plementary Data 1b, c). Correct identification of the mutations was 
confirmed by their co-segregation with O expression among F; lines (Sup- 
plementary Table 2c, d), and by transformation of the MACs of E- 
determined, O-expressing mutants with the wild-type alleles, which resulted 
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in mtA transcription and E expression in both cases (Fig. 3b and Extended 
Data Fig. 3). Thus, both mtB and mtC seem to be transcription factors 
required for mtA expression in E-determined cells. The mtB and mtC 
genes are constitutively expressed at low levels, and both of the encoded 
proteins are predicted to be nuclear. mtC is a 138-amino-acid protein 
containing a C2H2 zinc finger, the structure of which is probably affec- 
ted by the mtC° mutation (Supplementary Data 2). Although the 310- 
amino-acid mtB protein does not contain any recognizable domain, an 
mtB-GFP fusion protein was found to localize to the MAC (Fig. 3c). Deep 
sequencing of the transcriptomes of mtB° and mtC° mutants further 
showed that mtA is the only gene that requires both factors for express- 
ion during sexual reactivity; importantly, it is the only gene found to be 
downregulated in the mtB° mutant (Supplementary Table 3). 


mtA promoter excision is regulated by scnRNAs 

Excision of the mtA promoter in mating-type O cells occurs between two 
5'-TA-3’ dinucleotides within a reasonable IES end consensus (Supple- 
mentary Data 1a), suggesting that the general IES excision machinery is 
involved. RNAi-mediated depletion of the Pgm endonuclease” during 
autogamy of O cells indeed impaired excision and resulted in accumula- 
tion of unexcised copies in the developing new MAC (Fig. 4a). So did 
depletion of proteins known to be involved in snRNA biogenesis or 
action, including the Dicer-like Dcl2 and Dcl3 (ref. 12), the Piwi-like 
Ptiwi01 and Ptiwi09 (ref. 11), and the Nowal and Nowa2 RNA-binding 
proteins”, indicating that the scnRNA pathway is required to target exci- 
sion in O cells. Depletion of each pair of proteins causes massive reten- 
tion of IESs genome-wide, resulting in non-functional new MACs and 
post-autogamous lethality. In conditions of partial depletion, obtained 
by silencing only one of the two Piwi (or the two Dicer-like) genes, the 
progeny is usually viable. Analysis of individual clones showed that they 
frequently retained the mtA promoter in the new MAC, and in all cases 
this correlated with a switch to mating type E (Fig. 4b). 

No other IES is known to be affected in these conditions; in this regard, 
excision of the mtA promoter behaves like MAC deletions of cellular 
genes, which were shown to be more sensitive to partial impairment of 
the snRNA pathway’””*. Whereas other IESs are intervening sequences 
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Figure 4 | Genes required for excision of the mtA promoter in O cells. 

a, PCR analysis (PCR5) of mtA promoter retention in mass progenies of O 
clones after RNAi-mediated silencing of the indicated genes. Total DNA 
samples were prepared from starved post-autogamous cells when the new 
MACs were clearly visible. Because the parental MAC is still present at this 
stage, the promoter-excised version is amplified in all cases; the promoter- 
retaining fragment can be detected only if it accumulates in zygotic MACs. 
ND7, unrelated-gene negative control; D2, DCL2; D3, DCL3; C1, no-silencing 
negative control; C2, empty-vector RNAi control; P1, PTIWI01; P9, PTIWIO09; 
CE, control PCR5 on E cells. b, PCR analysis (PCR5) and mating-type tests 
(below) of individual viable post-autogamous clones from non-lethal silencing 
conditions (including partial silencing of NOWA1). 
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that must be excised to reconstitute functional genes, the 195-bp seg- 
ment is a functional part of the mtA gene. The homologous segment is 
not excised in mtA orthologues from other P. aurelia species (see below), 
indicating that excision of a bona fide part of the MAC genome evolved 
as a derived character in P. tetraurelia, as random mutations happened 
to create sites suitable for Pgm-mediated excision. 

Deep sequencing of small RNAs from an early conjugation time point 
confirmed that scnRNAs are produced from both strands of the mtA pro- 
moter, as they are from the rest of the germline genome (Extended Data 
Fig. 4). On average, scnRNA coverage was ~2-fold higher for IESs than 
for other genome features, as independently found in another study™*. 
Although we cannot exclude that this results from precocious degrada- 
tion of MAC-matching scnRNAs, an intrinsically more abundant pro- 
duction of snRNAs from IESs could explain why IES excision is less 
sensitive to partial impairment of the scnRNA pathway than gene dele- 
tion. Northern blot analyses of autogamy time courses showed that mtA- 
promoter scnRNAs are produced in similar amounts during meiosis of 
Oand E cells, and failed to detect any difference in the timing of their dis- 
appearance (Extended Data Fig. 5). Thus, mtA-promoter sonRNAs may 
be inactivated in E cells by some other mechanism than the active degra- 
dation proposed in Tetrahymena thermophila”, for instance by seques- 
tration in the maternal MAC. Whatever the precise mechanism, one 
prediction of the model is that RNAi-mediated destruction of maternal 
transcripts of the mtA promoter during autogamy of E cells will prevent 
inactivation of homologous scnRNAs, licensing them to target excision 
in the zygotic MAC”. Indeed, feeding E cells with double-stranded RNA 
homologous to the mtA promoter before autogamy induced its precise 
excision in the new MAC and resulted in O progeny (Supplementary 
Table 4). The induced type O was thereafter inherited for at least three 
sexual generations (not shown). 


A different switch in P. septaurelia 


P. tetraurelia belongs to a group of 15 sibling species that are morpho- 
logically indistinguishable but sexually incompatible'”’. All have homo- 
logous O and E mating types, as shown by cross-agglutination between 
the O types of some species and the E types of others. However, maternal 
inheritance of mating types is observed in only half of these species. In 
others, mating types are randomly determined during MAC develop- 
ment, without any influence of the maternal MAC. The distribution of 
these systems in the phylogenetic tree of aurelia species suggests multiple 
changes during evolution of the complex’*** (Extended Data Fig. 6). We 
first sequenced mtA, mtB and mtC orthologues in two strains of P. octau- 
relia, a maternal-inheritance species closely related to P. tetraurelia, with 
which it can form viable (though sterile) F; hybrids**. We found the mtA 
promoter to be excised in O clones, with the same boundaries as in 
P. tetraurelia, but not in E clones (Extended Data Fig. 7 and Supple- 
mentary Data 1a). This excision system thus probably evolved in the 
common ancestor of the two species. 

We then examined P. septaurelia, another maternal-inheritance spe- 
cies that groups with random-determination species. Sequencing of the 
mtA, mtB and mtC orthologues in the wild-type strain 227 showed that 
the mtA~”’ promoter was not excised in the MACs of O cells (Supple- 
mentary Data 1a). Instead, MAC copies of the mtB?”” gene contained 
either of two alternative deletions of coding-sequence segments between 
IES-like boundaries (Fig. 5a and Supplementary Data 1b), suggesting that 
the gene was rearranged into non-functional forms. Our stock of strain 
227 contained only O cells, but complementation of their MACs by micro- 
injection of the P. tetraurelia pmtB”' transgene made these cells phenoty- 
pically E and resulted in mtA transcription during sexual reactivity (Fig. 5b). 
Thus, mating type E is characterized by mtB-dependent mtA expression 
in P. septaurelia, as in P. tetraurelia. 

The mtB gene belongs to a family of up to six paralogues in P. aurelia 
species, and sequence conservation indicates that all have been under putri- 
fying selection since before speciation (not shown). The excision events 
that inactivate mtB*”’ in O cells are not observed in any other paralogue 
or in mtB orthologues from other species, where the 5’ TAs correspond 
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Figure 5 | mtB rearrangements determine mating types in P. septaurelia. 
a, MIC and MAC versions of mtB””. The MIC sequence contains two IESs 
(black boxes), conserved in P. tetraurelia strain 51. In the MAC of O cells of 
strain 227, the mtB?7” coding sequence suffers two alternative deletions, each 
between TA dinucleotides within IES-end consensus sequences (TAYAG, see 
Supplementary Data 1b). b, Northern blot analysis of mtA expression in a 
sexually reactive, pmtB°'-transformed O clone of strain 227 (T). C, control 
uninjected O clone. Tested mating types are indicated. c, Transformation of 
P. tetraurelia clones of mating type O with the P. septaurelia mtA~”’ gene. A 
duplex PCR was used to assess transgene copy numbers through the relative 
abundance of a 176-bp transgene-specific product (pmtA7’’, PCR12), 
compared to the 266-bp product from the promoterless endogenous mtA 
(mtA*!, PCR1; undetectable in high-copy transformants). C1 and C2, 
uninjected control clones. Each clone was tested with P. tetraurelia (51) and 
P. septaurelia (227) O and E testers. Number of plus signs represents strength 
of mating reaction; minus indicates no agglutination. 


to different nucleotides. As with mtA promoter excision in P. tetraurelia, 
this excision system apparently does not originate from any transposable 
element insertion but seems to have arisen in P. septaurelia after the 
chance appearance of good matches to the IES end consensus, which seems 
to be conserved among P. aurelia species”. 

Taking advantage of the evidence that the mtB*’ protein from 
P. tetraurelia can activate the mtA7~” promoter in P. septaurelia, we micro- 
injected the mtA*”’ gene into the MACs of P. tetraurelia cells of mating 
type O, and tested the sexual preference of transformed clones with O and 
E testers of both species (Fig. 5c). Consistent with the reported lack 
of cross-agglutination between the complementary types of these two 
species’, uninjected control clones reacted only with P. tetraurelia E testers. 
In contrast, transformed clones agglutinated only with P. septaurelia 
O testers, allowing interspecific conjugation to occur. Thus, mtA~”” is 
sufficient for expression of the P. septaurelia E specificity, incidentally 
confirming that mtA localizes on the external side of ciliary membranes. 
The lack of reaction of transformed clones with P. tetraurelia O testers 
indicates that mtA interacts with an unknown O-specific receptor which 
also differs between species. Furthermore, transformed clones did not 
react with P. tetraurelia E testers, indicating that mtA7” expression, like 
mtA” expression (Extended Data Fig. 1), functionally masks expression 
of the endogenous O-specific receptor. This would explain the default 
nature of the O type. 

Strain 38 is a natural P. septaurelia isolate carrying a Mendelian muta- 
tion at a single locus (mt) which, in contrast to the P. tetraurelia muta- 
tions, affects both the expression and the epigenetic inheritance of mating 
types”. This dual effect suggests that mt is the gene that controls mating 
types through an alternative rearrangement in that species (see Extended 
Data Fig. 8), providing an independent means to confirm the identifica- 
tion of the P. septaurelia switch. Sequencing of the mtB”* allele revealed 
mutations that can explain both effects (Extended Data Fig. 8c and Supple- 
mentary Data 1b), and the analysis of a cross between strains 38 and 227 
showed that mt is identical or closely linked to mtB (Supplementary Table 5). 
To confirm directly that mtB rearrangements are regulated by homo- 
logy-dependent maternal effects, the full-length, functional MAC form 
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of the mtB°” allele was microinjected into the MACs of O cells of strain 
227. Transformed clones expressed E and produced E post-autogamous 
progeny (Extended Data Fig. 9). We conclude that mating types are determ- 
ined in P. septaurelia by maternally inherited alternative rearrange- 
ments of the mtB gene, a mechanism that must have evolved independently 
from that of P. tetraurelia. 


Discussion 


Seventy-five years after mating types were first described in P. tetraurelia, 
the expression of a single gene, mtA, was found to make the difference 
between the two types. mtA transcription in E cells was further shown to 
require the mtB and mtC gene products, and this E-specific pathway seems 
to be conserved in other P. aurelia species. Our results indicate that mtA is 
a ciliary transmembrane protein directly involved in the species-specific 
recognition of O cells. Although the nature of the O-specific receptor 
remains unknown, its expression or function appears to be masked by 
the mtA protein, given the default O phenotype observed in mutants. 
Expressing the P. septaurelia mtA orthologue in O cells of P. tetraurelia 
not only provided them with the P. septaurelia E specificity, but also 
blocked expression of the P. tetraurelia O specificity. In the related 
P. caudatum, an antigen involved in agglutination was shown to be pro- 
duced in the cytoplasm of both O and E reactive cells, but to localize in the 
ciliary membrane only in O cells”. In E cells of P. aurelia species, the mtA 
protein could inhibit the synthesis, processing or transport of the O- 
specific receptor. mtA is structurally similar to the proteins recently shown 
to be specific for each of the seven mating types of T. thermophila”, but 
how these function in mating-type recognition remains to be determined. 

Despite conservation of the mtA on/off switch as the E/O expression 
determinant, the mechanisms used to produce O clones differ among 
P. aurelia species. Excision of the mtA promoter is shared between the 
closely related P. tetraurelia and P. octaurelia, but in P. septaurelia mtA 
expression is prevented by deletions in mtB (Extended Data Fig. 10). This 
seems to be functionally equivalent, as mtB is highly specific for the mtA 
promoter in P. tetraurelia. The P. tetraurelia rearrangement involves the 
IES excision machinery and is regulated by the scnRNA pathway, ex- 
plaining the maternal inheritance of mating types in this species, and 
probably in the other two as well. Ciliate smRNAs resemble metazoan 
piRNAs in many respects, including their central roles in epigenetic 
reprogramming events that are critical for sexual reproduction”, and 
both have been described as genomic ‘immune systems’ primarily 
involved in the control of transposable elements**”*’. One difference, 
however, is that metazoan piRNA systems keep a memory of ‘non-self se- 
quences, based on integration into specific small-RNA-producing loci®”. 
The Paramecium system seems to work the opposite way, keeping a 
memory of ‘self’ sequences in the form of the maternal MAC, which is 
probed by genome-wide scnRNAs at each generation to identify ‘non- 
self molecules by subtraction. 

As a consequence, the Paramecium genome-defence mechanism can 
easily be recruited for heritable regulation of bona fide cellular genes, an 
important question that has been raised in diverse eukaryotes***“*. In- 
deed, the sequences being excised from the mtA and m#B genes differ 
from other IESs*” in that they are functional parts of cellular genes and 
apparently do not derive from the insertion of transposable elements; 
moreover, the mtA rearrangement behaves like deletions of cellular genes 
in its mechanistic requirements, which show subtle differences with IES 
excision'’’’. The independent evolution of the P. tetraurelia and P. sep- 
taurelia switches suggests that exaptation of the sonRNA pathway is a 
general mechanism for transgenerational epigenetic inheritance of dif- 
ferentiated states. In plants and mammals, epigenetic regulation of cel- 
lular genes often relies on transposon-induced epigenetic changes***, as 
is the case of sex determination in melon”. The capacity of the Parame- 
cium mechanism to inactivate any single-copy gene makes it even more 
flexible, dispensing with the need for transposon insertion. In keeping 
with the low information content of the IES end consensus, acciden- 
tal rearrangement errors are not rare genome-wide***’, and may occa- 
sionally provide a selective advantage in specific conditions; maternal 
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transmission to sexual progeny would thus allow continuous adaptation 
of the somatic (MAC) genome, independently of any Mendelian (MIC) 
mutation. 


METHODS SUMMARY 


Unless otherwise stated, all experiments were carried out with the entirely homo- 
zygous strain 51 of P. tetraurelia; the origins of other strains and species are given in 
the Methods section. Molecular biology experiments used standard procedures. Para- 
mecium-specific methods (RNAi-mediated gene silencing by dsRNA feeding, DNA 
microinjection into the MAC) have been published and are referenced in the Methods 
section. For mating-type tests, testers were prepared from cell lines of known mating 
types by re-feeding ~1,000 autogamous cells in tubes for 2-3 divisions. The proce- 
dures used to test mass post-autogamous progenies or individual clones are presented 
in the Methods section. Bioinformatic analyses and statistical testing (microarray ex- 
pression data, whole-genome transcriptome profiling by RNA-seq) are described in 
the Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Paramecium strains and cultivation. Unless otherwise stated, all experiments 
were carried out with the entirely homozygous strain 51 of P. tetraurelia. The mutant 
strains mtA°, mtB° and mtC°, as well as strain 32 of P. tetraurelia, were from the stock 
collection of the Centre de Génétique Moléculaire in Gif-sur-Yvette, France. The mtA? 
mutant was independently ordered from ATCC (ATCC number 30762). Strains 138 
and GFg-1 of P. octaurelia were from the stock collection of the Institute of Systematics 
and Evolution of Animals, Krakow, Poland. Strain 227 from P. septaurelia was from 
the stock collection of the Centre of Core Facilities ‘Cultivation of Microorganisms’ in 
St Petersburg State University, Russia. Strain 38 of P. septaurelia was ordered from 
ATCC (ATCC number 30575). Cultivation and autogamy were carried out at 27 °C 
(unless otherwise stated) as described*”"!. 

DNA and RNA extraction, Southern and northern blots. DNA and RNA samples 
were typically extracted from 200- to 400-ml cultures of exponentially growing cells 
at <1,000 cells ml! or of autogamous cells at 2,000-4,000 cells ml !as previously 
described’. Small-scale DNA samples were prepared from =1,000 cells using the 
NucleoSpin Tissue kit (Macherey-Nagel). The TRIzol (Invitrogen) RNA extraction 
procedure was adapted for small-scale cultures (~20 ml) of individual transformed 
clones during sexual reactivity. Electrophoresis and blotting were carried out accord- 
ing to standard procedures. For small-RNA northern blots, small RNAs were en- 
riched by PEG8000 precipitation, and the equivalent of 75 ug of total RNA for each 
time point was run on a 15% polyacrylamide-urea gel, transferred on Hybond NX 
membranes, and chemically crosslinked*’. 

Microarray expression data. Expression data were obtained from single-channel 
NimbleGen microarrays covering all 39,642 annotated genes, with six different 50-mer 
probes per gene. Raw signals were processed using the standard RMA method”. This 
includes a first step of background subtraction for each array, followed by between- 
array normalization which was carried out using the normalizeBetweenArrays func- 
tion from the limma package™. The latter step adjusts signals so that expression values 
have similar distributions in the two arrays considered in the analysis. The expression 
level of each gene was taken as the median signal from the six probes. The microarray 
platform has been described in more detail elsewhere”. 

Alignment and prediction programs. DNA and protein sequences were aligned 
using the MUSCLE software on the phylogeny.fr website”. Prediction of protein locali- 
zation and transmembrane protein topology used the PSORT II’ and PolyPhobius”* 
servers. 

Mating-type tests. Testers were prepared from cell lines of known mating types by 
re-feeding ~ 1,000 autogamous cells in tubes with 4 ml of 0.2 X WGP medium bacter- 
ized with Klebsiella pneumoniae (light medium) and incubating overnight at 27 °C. 
The next day, tubes were re-fed with 8 ml of light medium and again incubated over- 
night at 27 °C. The following day, reactive cells concentrated near the top of the tube 
were collected (~1.5 ml per tube), checked by mixing aliquots with reactive cells of 
the complementary type, and used in mating type tests. Mass post-autogamous pro- 
genies to be tested were made reactive in the same way. To test individual clones, 
single karyonides (out of autogamy or conjugation) were isolated in 250 pl of light 
medium and incubated until starved. They were then re-fed with 250 il and tested 
the next day. 

RNAi-mediated gene silencing by dsRNA feeding and microinjections. Procedures 
for RNAi and DNA microinjection into the MAC, as well as inserts used to silence 
scnRNA-pathway genes and PGM and the procedure to silence genes during auto- 
gamy, have been described*'"’**”?. To silence genes during sexual reactivity, cells 
were made reactive as described under ‘Mating-type tests’, except that the second re- 
feeding with 8 ml was done with dsRNA feeding medium instead of normal WGP 
medium. 

Plasmid constructs and probes. The plasmids used in this study are described in 
the relevant figures or Extended Data. The mtA-GFP fusion transgene used in Fig. 2d-f 
is a modified version of pmtA-ES (Extended Data Fig. 1) in which the EGFP coding 
sequence, flanked by Gly-Ser-Gly-Gly and Gly-Gly linkers, was inserted in place of 
mtA residue Asn 1002. The pmtA7”’ transgene used in Fig. 5c is the P. septaurelia 
equivalent of pmtA-ES, with complete flanking intergenic regions (without the down- 
stream unannotated gene). Complete sequences of all plasmids are available on request. 
The mtA probe used to hybridize northern blots covered the whole coding sequence, 
except on the left panel of Fig. 1c where it spanned positions 415-1010 of the corrected 
gene model (PTETG5300016001, see ParameciumDB). The P. septaurelia probe in 
Fig. 5b and Extended Data Fig. 8d covered region 78-2245 of the mtA** gene model. 
For other genes, probes covered the indicated regions of each gene model: mtAL1, 
3053-3599 of GSPATG00025159001; mtAL2, 3328-3790 of GSPATG00002922001; 
PTIWIO9, 50-439 of GSPATG00020796001; NOWA2, 2513-3330 of GSPATG00016 
668001; PGM, 1540-2390 of GSPATG00016627001; PTIWI10, 4-264 of GIPATG 
00009468001. Oligonucleotide probes for scaffold-22 endogenous siRNAs (Extended 
Data Fig. 5c) have been described. 
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Confocal analysis of mtA-GFP localization. Cells were fixed in 2% paraformalde- 
hyde in PBS buffer and rinsed in the same buffer supplemented with 3% BSA. All sub- 
sequent steps were performed in this buffer: cells were incubated with a commercial 
anti-GFP rabbit serum (Molecular Probes A6455, Invitrogen) at a 1/1,000 dilution, 
rinsed twice, incubated with an Alexa 568-coupled anti-rabbit antibody (Molecular 
Probes goat anti-rabbit IgG(H+L) A11011, Invitrogen), rinsed in 1 pg ml~' Hoechst 
33258, mounted in Citifluor (Citifluor Ltd London), and observed under a Leica Con- 
focal SP8 microscope with 405, 488 and 552 laser line excitations for blue, GFP and 
DIC, and red detection, respectively. Images were merged using ImageJ and Adobe 
Photoshop. 

Deep sequencing of scnRNAs. Small-RNA libraries were previously constructed 
using 24-nt adaptors’*. The corresponding PCR products were extended with 6 
degenerate nucleotides at each end and ligated to Illumina adaptors essentially 
following Illumina’s recommendations (TruSeq DNA sample prep kit protocol) 
before sequencing on the GAIIx system (100-nt single reads) (Illumina, USA). 
Whole-genome transcriptome profiling. Eight Illumina sequencing libraries were 
constructed (two biological replicates for mtB°, two for mtC°, and two wild-type con- 
trols for each) using the Illumina TruSeq RNA Sample Preparation kit. Approxi- 
mately 30 million 101-nt read pairs were obtained for each library with the Illumina 
HiSeq 1000 sequencing system. Reads were processed to remove adapters and trimmed 
for quality before mapping to the strain 51 genome*” using BWA software™. The frag- 
ments that mapped to each annotated gene” were counted only if both reads in the 
pair mapped to the same gene and only if the mapping for both reads was unique in 
the genome. Fragments were only counted if neither read in the pair had more than 
two mismatches. Fragment counts per gene were obtained from the BWA output 
using Samtools™ and a custom Perl script. Surface antigen genes were excluded from 
the analysis as the cell cultures were not controlled for serotype and surface antigen 
mRNA can account for 3% of total cellular mRNA. Differential gene expression was 
determined from fragment counts using the DESeq package®. P values were cor- 
rected for multiple testing using the Benjamini-Hochberg procedure. 


50. Beisson, J. et al. Mass culture of Paramecium tetraurelia. Cold Spring Harb. Protoc. 
2010, http://dx.doi.org/10.1101/pdb.prot5362 (2010). 

51. Beisson, J. et al. Maintaining clonal Paramecium tetraurelia cell lines of controlled 
age through daily reisolation. Cold Spring Harb. Protoc. 2010, http://dx.doi.org/ 
10.1101/pdb.prot5361 (2010). 

52. Pall,G.S.& Hamilton, A. J. Improved northern blot method for enhanced detection 
of small RNA. Nature Protocols 3, 1077-1084 (2008). 

53. Irizarry, R. A. et al. Exploration, normalization, and summaries of high density 
oligonucleotide array probe level data. Biostatistics 4, 249-264 (2003). 

54. Smyth, G. K. & Speed, T. Normalization of cDNA microarray data. Methods 31, 
265-273 (2003). 

55. Arnaiz, O. et al. Gene expression in a paleopolyploid: a transcriptome resource for 
the ciliate Paramecium tetraurelia. BMC Genom. 11, 547 (2010). 

56. Dereeper, A. etal. Phylogeny.fr: robust phylogenetic analysis for the non-specialist. 
Nucleic Acids Res. 36, W465-W469 (2008). 

57. Nakai, K. & Horton, P. PSORT: a program for detecting sorting signals in proteins 
and predicting their subcellular localization. Trends Biochem. Sci. 24, 34-35 
(1999). 

58. Kall, L, Krogh, A. & Sonnhammer, E. L.An HMM posterior decoder for sequence 
feature prediction that includes homology information. Bioinformatics 21 
(suppl. 1), 251-257 (2005). 

59. Beisson, J. et al. DNA microinjection into the macronucleus of Paramecium. Cold 
Spring Harb. Protoc. 2010, http://dx.doi.org/10.1101/pdb.prot5364 (2010). 

60. Beisson,J. etal. Silencing specific Paramecium tetraurelia genes by feeding double- 
stranded RNA. Cold Spring Harb. Protoc. 2010, http://dx.doi.org/10.1101/ 
pdb.prot5363 (2010). 

61. Marker, S., Le Mouel, A., Meyer, E. & Simon, M. Distinct RNA-dependent RNA 
polymerases are required for RNAi triggered by double-stranded RNA versus 
truncated transgenes in Paramecium tetraurelia. Nucleic Acids Res. 38, 4092-4107 
(2010). 

62. Aury,J.M. etal. Global trends of whole-genome duplications revealed by the ciliate 
Paramecium tetraurelia. Nature 444, 171-178 (2006). 

63. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler 
transform. Bioinformatics 25, 1754-1760 (2009). 

64. Li, H. etal. The sequence alignment/map format and SAMtools. Bioinformatics 25, 
2078-2079 (2009). 

65. Anders, S. & Huber, W. Differential expression analysis for sequence count data. 
Genome Biol. 11, R106 (2010). 

66. Brygoo, Y., Sonneborn, T. M., Keller, A. M., Dippell, R. V. & Schneller, M. V. Genetic 
analysis of mating type differentiation in Paramecium tetraurelia. ||. Role of the 
micronuclei in mating-type determination. Genetics 94, 951-959 (1980). 

67. Brygoo, Y. Genetic analysis of mating-type differentiation in Paramecium 
tetraurelia. Genetics 87, 633-653 (1977). 

68. Kapusta, A. et al. Highly precise and developmentally programmed genome 
assembly in Paramecium requires ligase |V-dependent end joining. PLoS Genet. 7, 
e1002049 (2011). 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 
pmtA-E —=—a en ee 
+ >PCR1 
prt £ES 
+ >PCR1 
pmtA-IES —o nae 
*pcr3”  ~  pcR2 
plES 
<> PCR4 
b pmtA-E pmtA-ES 
Clone 123 45 6 12 3 4 5 6 7 8 9 C1C2pE 
PCR‘ (See ey cele cee cot see (eee ee ee noe ee aa 
Matingtype O E EOEOGEEOEEEEOE OOD 
Progeny O E EOEOC EEOCEEEOOE OO 
c pmtA-lIES 
Clone 123 45 67 8 9 1011 M 
we - 3 kb 
PCR2 =—- —— — =.-15 
Matingtype O EOEEOOEOOO 
d plIES 
Clone 12 3 4 5 6 7 8 9 10 11 C1 C2 pD M 
— 
PCR3 es eka meee — — |e eee me 1 Kb 
+PCRH ee = = eS = = — = — —-05 
Progeny 12 3 4 5 6 7 8 Q9 10 11 C1 C2 pE 
PCR1 "ju ae acc oe dos ee ell cn en do) ek a 
Matingtype E E E S E E EEEEE OO 


Extended Data Figure 1 | The 195-bp segment contains the mtA promoter 
and ensures maternal inheritance of its own retention. a, Plasmids used for 
microinjection into the MAC of O cells. The thick grey lines on either side of 
inserts represent vector sequences. Plasmid pmtA-E contains the EMAC 
version with entire intergenic regions; pmtA-ES lacks the downstream 
unannotated gene. pmtA-IES lacks intergenic sequences upstream of the 
195-bp segment. pIES contains only the 195-bp segment. The PCRs used for 
testing transformation are shown. b, Transformation of O cells with pmtA-E or 
pmtA-ES. PCR1 amplifies the promoter-containing form from either plasmid 
and the promoter-excised form from the endogenous mtA gene; relative 
amounts provide an indication of plasmid copy numbers. The mating types of 
injected clones and of their mass post-autogamous progeny are indicated. C1 
and C2, uninjected controls; pE, control PCR on pmtA-E. c, Transformation of 
O cells with pmtA-IES. The plasmid-specific PCR2 (1,533 bp) identifies 
transformed clones. The O type of clones 9 and 10 could be due to transgene 
silencing. d, Transformation of O cells with pIES. Injected clones were tested 
with a duplex PCR: PCR4 amplifies a 450-bp product from the plasmid and 
PCR3 a 1,035-bp product from the endogenous mtA gene, hardly detectable in 
high-copy transformants. C1 and C2, uninjected controls; pD, control duplex 
PCR on an equimolar mix of pIES and pmtA-E. PCR1 revealed that the 195-bp 
segment was retained in the mass progeny of transformed clones; the selfer 
phenotype (S) of the mass progeny from clone 4 probably reflects heterogeneity 
among individual post-autogamous clones. 
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Extended Data Figure 2 | Genetic analysis of mutations affecting mating- 
type expression. a, Mendelian segregation of a pair of alleles. In both 
conjugation and autogamy, the two MICs undergo meiosis, but only one 
haploid product is retained in each cell; an additional mitosis then produces two 
identical gametic nuclei. During conjugation of genetically different cells (m/m 
and +/+), reciprocal exchange of one gametic nucleus, followed by karyogamy, 
therefore results in genetically identical zygotic nuclei in the two exconjugants. 
The drawing shows the heterozygous MICs and MACs that develop from 
copies of the zygotic nuclei in each of the F, cells. During autogamy, the two 
identical gametic nuclei fuse together, resulting in entirely homozygous 
zygotes; post-autogamous F, progeny of heterozygotes have a 50% probability 
to keep each of the parental alleles. b, Maternal (cytoplasmic) inheritance of 
mating types. Coloured Os and Es around the cells indicate the mating type 
expressed by each vegetative clone; the MACs are coloured to symbolize their 
mating-type determination states (O, blue; E, orange). Because little cytoplasm 
is exchanged during conjugation, the parental MACs independently condition 
zygotic MACs for the same mating types, resulting in cytoplasmic inheritance: 
the F, derived from the O parent is determined for O in 94% of cases, and the F, 
derived from the E parent is E in 98% of cases®. The frequency of mating-type 
reversal is much lower at autogamy: <1/50,000 in the O-to-E direction, and 
1/3,000 in the E-to-O direction”. c, Cross of O-determined expression mutants 
mtA°, mtB° and mtC° to wild-type E cells. F; mutant homozygotes formed in 
the E cytoplasmic lineage express mating type O as a result of the mutation, but 
remain determined for E (orange MAC). d, Cross of O-expressing, 
E-determined mutants mtA°®, mtB° and mtC° (as produced in c) to wild-type E 


cells. E determination of the mutant parent is revealed by E determination and 
expression of the derived F,, which has received the wild-type allele. In this 
cross all cells are determined for E, and the expressed mating type simply 
depends on genotype. e, mtA promoter retention in the cross shown in 

c, illustrated here for a cross of the mtA° mutant with wild-type E cells. Top 
panel: PCR analysis (PCRS) of two sets of 4 mtA°/mtA* F, karyonides 

(after fertilization, two new MACs develop in each cell from copies of the 
zygotic nucleus, and then segregate to daughter cells, called karyonides, at the 
first cellular division; each pair of conjugants thus gives rise to 4 F, karyonidal 
clones). Each set contained 2 O clones that had excised the mtA promoter, and 
probably derived from the mutant O parent, and two E clones that retained it, 
and probably derived from the wild-type E parent (mating types are indicated 
below each lane). Bottom panels: after autogamy of two mtA°/mtA* F, 
heterozygotes of mating type E (clone 4 in each set), 12 F, homozygous progeny 
were isolated for each, grown and tested for mating-type expression and for 
mtA promoter retention using PCR6 (Supplementary Table 6), which amplifies 
products of 665 bp and 470 bp from the promoter-containing and promoter- 
excised versions, respectively. All clones retained the promoter, although a 
fraction of them (14 of 23) expressed mating type O, as expected for mtA° 
homozygotes. ND, not determined. f, mtA promoter retention in F, 
heterozygotes from the cross shown in d, illustrated by a typical set of 4 mtB 
mtB* F, karyonides from the cross of an E-determined, O-expressing mtB° 
homozygote (as produced in c) to wild-type E cells. PCR5 showed that the mtA 
promoter was now retained in F, karyonides from both parents; all were of 
mating type E. 
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Extended Data Figure 3 | Complementation of the mtB° and mtC? 
phenotypes with the wild-type alleles of GSPATG00026812001 and 
GSPATG00009074001, respectively. a, Structure of the MIC and MAC 
versions of the mtB gene, and of the GFP fusion transgene used for 
complementation. The coding sequence (open arrow) is shown with the 
complete upstream and downstream intergenic regions. The MIC version 
contains two IESs (black boxes). Plasmid pmtB?'-NGEP contains the MAC 
version with complete intergenic regions, and the EGFP coding sequence was 
fused at the 5’ end of the mtB coding sequence. Thick grey lines on either side 
represent plasmid vector sequences. b, PCR analysis and mating types of 
E-determined mtB° mutant clones transformed with pmtB°'-NGFP. PCR7 
(top panel) amplifies products of 1,148 bp from the plasmid, and of 419 bp from 
the endogenous mtB gene (Supplementary Table 6). The relative abundance of 
the two products gives an indication of plasmid copy number in each clone. C1 
and C2, uninjected control clones. Mating types are indicated below each lane. 
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Clones containing detectable plasmid amounts expressed mating type E, 
indicating that the GFP fusion protein is functional; the selfing phenotype (S) of 
clone 11 may be due to some cells having lost the plasmid. PCR1 (bottom panel) 
confirmed that all clones retained the mtA promoter. c, Structure of the 

MIC and MAC versions of the mtC gene, and of the plasmid used for 
complementation. The coding sequence (open arrow) is shown with the 
complete upstream and downstream intergenic regions. The MIC version 
contains one IES (black box). Plasmid pmtC” contains the MAC version with 
349 bp and 98 bp of upstream and downstream intergenic sequences, 
respectively. The plasmid-specific PCR8 amplifies a 419-bp product 
(Supplementary Table 6). d, PCR analysis and mating types of E-determined 
mtC° mutant clones transformed with pmtC”’. PCR8 (top panel) shows that all 
positive clones expressed mating type E. C1, C2 and C3, uninjected control 
clones. PCR1 (bottom panel) confirmed that all clones retained the mtA 
promoter. 
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Extended Data Figure 4| Deep sequencing of scnRNAs from an early 
conjugation time point (early meiosis). A total of 39,041,474 small-RNA 
sequences of 25 nucleotides in length were obtained by Illumina sequencing of 
four libraries previously constructed from gel-purified molecules migrating at 
23, 24, 25, or 26 nucleotides (from the 7hND7 total RNA sample’). 

a, 25-nucleotide reads were first mapped (no mismatch allowed) on possible 
contaminant sequences (genomes of bacteria commonly found in cultures, 

P. tetraurelia mitochondrial genome, P. tetraurelia rDNA and other non- 
coding RNAs). The remainder was then mapped on known MIC sequences (the 
‘MAC+IES’ genome, and 9 individual copies of the Sardine transposon’). To 
determine whether any of the remaining reads could correspond to IES excision 
junctions or to spliced transcripts, they were then mapped on the MAC genome 
and on the genome-wide set of spliced transcripts. Very few hits were found and 
these did not show the characteristic 5’-UNG signature of snRNAs (see 

d), suggesting that these molecules represent longer forms of endogenous 
siRNAs and/or could be mapped because of IES or intron annotation errors. Of 
the remaining unmapped reads (~49%), close to one-half could be mapped on 
‘PGM contigs’, a ~25-Mb preliminary assembly of MIC-specific sequences that 
are not collinear to MAC chromosomes’ and are thus likely to represent bona 
fide scnRNAs. b, Statistics about the coverage of the ‘MAC+IES’ genome 
(17,786,284 reads). The average coverage is similar for exons, introns and 
intergenic regions, but is ~2-fold higher for IES sequences. This may mean 
either that scnRNAs are initially produced in higher amounts from IESs, or that 
active degradation of sonRNAs homologous to MAC sequences is already 
under way at this early stage. c, Mapping of 25-nucleotide reads on the MIC 
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version of the mtA gene region. Coding sequences of the corrected mtA gene 
model (PTETG5300016001, after correction of assembly indels and 
re-annotation) and of the short gene downstream (PTETG5300018001) are 
shown as red boxes interrupted by introns and IESs. The yellow box represents 
the 195-bp segment of the mtA promoter that is excised in O cells; the 4 IESs are 
shown as green boxes. 25-nucleotide reads mapping on the top or bottom 
strands of the region are colour-coded to indicate the number of times each one 
was sequenced: blue, one read; green, 2-9 reads; red, =10 reads. 

d, Compositional profiles (nucleotide frequency on the left, and deviation from 
randomness on the right) of reads mapping to the MAC+IES genome, to the 
MAC genome, or to ‘PGM contigs’. For each set, logos are shown for all reads 
(left), or for the non-redundant subset only (right). For the ‘MAC+IES’ 

and ‘PGM contigs’ sets, the logos computed from all reads clearly show the 
5'-UNG signature typical of sonRNA guide strands, as is the case for the major 
subset of reads starting with U (5’U), while the minor subset of sequences 
not starting with U (5’A/C/G), which may represent the steady-state amount of 
passenger strands, shows the complementary signature CNA at positions 
21-23. Deviation from randomness is greater when computed from all reads 
than when computed from the non-redundant subset only, indicating that 
molecules with the signature are intrinsically produced in higher amounts, or 
are more stable. The small set of reads that mapped only to the MAC genome 
(putative IES excision junctions) does not show a clear 5’-UNG signature, 
suggesting that most of those are not scnRNAs. The same is true of the small 
number of reads mapping to exon-exon junctions (not shown). 
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Extended Data Figure 5 | Northern blot analysis of mtA-promoter 
scnRNAs during autogamy of O or E cells. Mass cultures were allowed to 
starve, and RNA samples were extracted during exponential growth (Exp) and 
then at different times (0-20 h) after the appearance of the first meiotic cells. 
Cells become committed for autogamy at a fixed point of the cell cycle, so that 
the best synchrony that can theoretically be achieved is the duration of one 
cell cycle; in these experiments, the time between the first and the last cells to 
begin meiosis was ~12 h. a, Proportions of cells in different cytological stages at 
each time point, as determined by DAPI staining. Veg, vegetative cells; Mei, 
meiosis (crescent stage, meiosis I, meiosis II); Frg, cells with fragmented old 
MAC but new MACs not yet clearly visible; Dev, cells with two clearly visible 
developing new MACs; Kar, cells after the karyonidal division, with only one 
developing new MAC. b, Northern blot analysis of mRNAs for early (PTIWIO9, 
NOWA2), middle (PGM), or late (PTIWI10) genes. The same blots were 
hybridized successively with the 4 probes. c, Northern blot analysis of small 
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RNAs. The top panels show hybridization with the 195-bp mtA-promoter 
probe, revealing accumulation of 25-nucleotide scnRNAs slightly later than 
expression of the meiosis-specific genes PTIWI09 and NOWAZ. As a control, 
the same blots were rehybridized with an oligonucleotide probe specific for a 
cluster of 23-nucleotide endogenous siRNAs on scaffold 22'**', which are 
abundantly produced at all stages of the life cycle (bottom panels). 
Quantification of the mtA-promoter scnRNA signal and normalization with 
the siRNA signal did not reveal any significant difference in their amount or 
timing between the two mating types (not shown). Previous studies showed 
that the double-strand breaks that initiate IES excision in the new MACs start 
being detectable before the maximum of expression of the putative 
endonuclease PGM*® (no later than 10 h in these time courses). A PCR analysis 
of post-autogamous DNA samples confirmed that the mtA promoter was fully 
excised in mating type O, and fully maintained in mating type E (not shown). 
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Extended Data Figure 6 | Phylogenetic tree of different strains from most 
P. aurelia species based on sequence polymorphisms in three nuclear genes. 
This figure is modified with permission from figure 2 of Catania et al. 
Genetic diversity in the Paramecium aurelia species complex Mol. Biol. Evol. 
2009, 26, 421-431 (ref. 32). Wrong species assignment of some strains studied 


here are corrected in red. Strain V5-13 was probably mis-assigned to 

P. septaurelia through the same error as for strain GFg-1 (see Extended Data 
Fig. 7). Species showing maternal inheritance or random determination of 
mating types are highlighted in red and blue, respectively. 
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Extended Data Figure 7 | Strain GFg-1 belongs to the same species as strain 
138, that is, P. octaurelia. GFg-1 is among a set of strains that were originally 
assigned to P. septaurelia on the basis of conjugation tests with strain 38, a 
reference strain for that species. However, the stock of strain 38 used in these 
tests was not 38, but instead some P. octaurelia strain, as shown by the 
comparison of mtA sequences with those from the original strain 38 obtained 
from ATCC (ATCC number 30575) and those from strain 138, a reference 
strain for P. octaurelia. a, Scheme of the cross GFg-1 (O) X 138 (E). The mating 
types of parents were determined by cross-agglutination with P. tetraurelia 
tester lines. The green and blue arrows beside each cell represent the mtA gene, 
colour-coded to indicate the GFg-1 and 138 alleles; the box at the 5’ end 
symbolizes retention of the mtA promoter in the MAC genome on the E side of 
the cross. b, Molecular characterization of two pairs of F, heterozygotes, and of 


Micehondria! mia. promoter mating 
status type 

F1 pairs 
1a 138 ND ND E 
1b GFg-1 138/GFg-1 excised oO 
2a 138 ND ND E 
2b GFg-1 138/GFg-1 excised oO 
F2 clones, E side 
Jat 138 138/138 retained E 
1a2 138 GFg-1/GFg-1 retained E 
1a3 138 138/138 retained E 
2a1 138 GFg-1/GFg-1 retained E 
2a2 138 GFg-1/GFg-1 retained E 
2a3 138 GFg-1/GFg-1 retained E 
F2 clones, O side 
1b1 GFg-1 GFg-1/GFg-1 excised oO 
2b1 GFg-1 GFg-1/GFg-1 excised 
2b2 GFg-1 GFg-1/GFg-1 excised oO 


some F, clones obtained by autogamy of these F; heterozygotes. The parental 
origin of each F, clone was ascertained by a sequence polymorphism in the 
mitochondrial COXII gene (PCR9, Supplementary Table 6). PCR amplification 
and sequencing of a segment of the mtA gene encompassing the promoter 
(PCR10, Supplementary Table 6) showed that the two F, clones deriving from 
the GFg-1 parent were heterozygotes, and that the mtA promoter was precisely 
excised from both alleles (see Supplementary Data 1a). ND, not determined. 
Analysis of 6 viable F, clones obtained by autogamy of the F, clones deriving 
from the 138 parent showed that 4 of them were homozygous for the GFg-1 
allele, whereas the other two were homozygous for the 138 allele; the mtA 
promoter was retained in all cases. The evidence for successful genetic exchange 
between strains GFg-1 and 138 and for viable recombinant F, progeny 
demonstrates that these strains belong to the same species; that is, P. octaurelia. 
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Extended Data Figure 8 | Genetic and molecular analysis of the cross 
between strains 227 and 38 of P. septaurelia. a, Maternal inheritance of 
mating types in the wild-type strain 227. b, Strain 38 is genetically restricted to 
O expression, but constitutively determined for E. The mutation identified in 
strain 38 is of particular interest because it affects both the expression and the 
inheritance of mating types, which suggests that it lies in a gene that controls 
mating-type determination through an alternative rearrangement. Indeed, 
known P. tetraurelia mutations fall in two distinct categories. mtA°, mtB° and 
mtC° prevent expression of type E but have no effect on the rearrangement that 
determines mating types or on its maternal inheritance, whereas m#F’ lies in a 
trans-acting factor required for a subset of rearrangements during MAC 
development but has no effect on the expression of mating types during sexual 
reactivity. The only type of mutation that can be envisioned to affect both 
expression and determination/inheritance would be a mutation preventing 
expression of a functional protein required for E expression, and at the same 
time preventing in cis (by destroying a potential Pgm cleavage site) the 
rearrangement that normally inactivates this gene in the MAC of wild-type 
O cells. The mt*” allele of strain 38 restricts cells to O expression in a 
recessive manner, but also has a maternal effect that enforces constitutive E 
determination in sexual progeny. Notably, elegant experiments showed the 
latter effect to be dominant”: the sexual progeny of a cell carrying at least one 
mt" allele can never be determined for O or transmit O determination. 

c, Sequencing of the mtB** allele revealed features that may account for both 
effects. A frameshift mutation makes it a pseudogene, explaining the genetic 
restriction to O expression. In addition, a 6-bp deletion removes one of the IES- 
like boundaries used in the mtB7” allele for coding-sequence deletions in O 


clones. Given the requirements of IES excision in P. tetraurelia and in sibling 
species’, this can be predicted to make the same deletions impossible in the 
mtB* allele, which would explain the constitutive E determination effect. The 
full-length mtB*® pseudogene in the MAC of 38 cells would indeed be expected, 
after a cross to 227, to protect the highly similar zygotic mtB?” allele against 
coding-sequence deletions in the derived F,, through titration of homologous 
scnRNAs. d, Molecular analysis of the 38 X 227 cross. To verify this maternal 
effect, we crossed an E-expressing 227 clone (pmtB° 1 transformed clone T, 
same as in Fig. 5b; C, uninjected control) with strain 38, and F, heterozygotes 
were tested for mating types and for mtA expression by northern blotting. As 
expected, F, heterozygotes deriving from the 38 parent (1b and 2b, as 
determined by sequencing of a mitochondrial polymorphism) were E and 
expressed mt, indicating that the incoming mtB~~ allele had been rearranged 
into a functional, full-length form in the MAC. After autogamy of F, clone 2b, 
24 independent F, homozygotes were isolated and tested for mating types. 
Consistent with the Mendelian segregation of mtB alleles, 12 were O and 12 
were E (Supplementary Table 5); northern blot analysis of 3 clones of each type 
showed that only E clones expressed mtA. e, All F2 clones maintained the full- 
length mtB gene in the MAC. PCR11 (Supplementary Table 6) amplifies an 
888-bp fragment from the MAC version of mtB*®, and an 893-bp fragment 
from the full-length MAC version of mtB”"”, Cl and C2, control PCR11 on two 
O clones of strain 227, showing the 806-bp and 744-bp fragments resulting 
from the two alternative coding-sequence deletions. f, Mating types co- 
segregate with mtB alleles among F, homozygotes. Digestion of the PCR 
products with Alul distinguishes the 38 and 227 alleles. mtA and mtC alleles 
segregated independently (Supplementary Table 5). 
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Extended Data Figure 9 | Transformation of P. septaurelia 227 O cells with 
the full-length mtB?”’ gene induces heritable E expression. a, Plasmid 
pmtB~”, containing the full-length E MAC form of mtB7”’, was microinjected 
into the MACs of O cells of strain 227, and 12 injected clones were tested by 
PCR11. b, This PCR amplifies an 893-bp product from the transgene, and 
products of 744 and 806 bp from the two internally deleted MAC forms of the 
endogenous mtB”””, Cl and C2, control uninjected clones. Clones 4, 8, 9 and 11 
contained high copy numbers of the transgene, so that the PCR products from 
the endogenous gene are not detectable. Because only O tester lines were 
available, the mating types of injected clones could be ascertained only for 
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E-expressing clones, as indicated below each lane. This was the case of the 4 
high-copy clones; other clones were presumably O, although it was impossible 
to make sure that the cells were sexually reactive (indicated by ‘-’). Injected 
clones were then taken through autogamy, and the mass progenies were again 
tested for mating types. In this case, the mass progenies from clones 9 and 11, 
which proved to be pure E, were used as E testers to carry out a full test for some 
of the other progenies. Of the other two transformed clones expressing E, clone 
4 gave rise to a selfing progeny (S), probably a mix of O and E clones, whereas 
clone 8 gave rise to a pure O progeny. 
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Extended Data Figure 10 | A general model for mating-type determination 
in P. aurelia species. In the three species examined, mating type E depends on 
expression of the mtA protein during sexual reactivity. mtA transcription in 
turn requires the mtB and mtC gene products (the requirement for mtC in 

P. septaurelia, and for both genes in P. octaurelia, remains to be verified). In 


ARTICLE 


P. septaurelia 


P. tetraurelia and P. octaurelia, mating type O is determined during MAC 
development by excision of the mtA promoter as an IES, preventing expression 
of the gene. In P. septaurelia, mating type O is determined by the excision of 
segments of the mtB coding sequence as IESs, which similarly prevents mtA 
expression. 
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Amygdala interneuron subtypes control 
fear learning through disinhibition 


Steffen B. E. Wolff!?*, Jan Griindemann'*, Philip Tovote', Sabine Krabbe!, Gilad A. Jacobson!, Christian Miiller', Cyril Herry’, 
Ingrid Ehrlich*, Rainer W. Friedrich', Johannes J. Letzkus'+* & Andreas Liithi!* 


Learning is mediated by experience-dependent plasticity in neuronal circuits. Activity in neuronal circuits is tightly 
regulated by different subtypes of inhibitory interneurons, yet their role in learning is poorly understood. Using a 
combination of in vivo single-unit recordings and optogenetic manipulations, we show that in the mouse basolateral 
amygdala, interneurons expressing parvalbumin (PV) and somatostatin (SOM) bidirectionally control the acquisition of 
fear conditioning—a simple form of associative learning—through two distinct disinhibitory mechanisms. During an 
auditory cue, PV* interneurons are excited and indirectly disinhibit the dendrites of basolateral amygdala principal 
neurons via SOM" interneurons, thereby enhancing auditory responses and promoting cue-shock associations. During 
an aversive footshock, however, both PV‘ and SOM* interneurons are inhibited, which boosts postsynaptic footshock 
responses and gates learning. These results demonstrate that associative learning is dynamically regulated by the 
stimulus-specific activation of distinct disinhibitory microcircuits through precise interactions between different sub- 


types of local interneurons. 


Fear conditioning is a powerful model system for investigating plas- 
ticity of neuronal circuits and the mechanisms of associative learning, 
as the basic underlying circuitry has been thoroughly studied over the 
last decades’. The basolateral amygdala (BLA) has been identified as 
a key brain area involved in auditory fear conditioning, where associa- 
tive synaptic plasticity at glutamatergic sensory afferents is induced by 
the pairing of a neutral tone (conditioned stimulus (CS)) and a mild 
aversive footshock (unconditioned stimulus (US)) during conditioning"*”. 
Asa cortex-like structure’, the BLA contains a variety of interneurons 
with marker expression, basic properties and connectivity very similar 
to neocortical circuits>’-°, and inhibition has been shown to have a 
role in fear conditioning*. Functionally, salient sensory stimuli lead to 
cell-type-specific responses in different interneuron subtypes in anae- 
sthetized animals'*. Moreover, fear conditioning and extinction have 
been shown to induce opposing changes in the strength of inhibitory 
transmission and in the expression of inhibition-related genes*!*""”. 
However, besides the general notion that inhibition is involved in fear 
learning, knowledge about the role of individual interneuron subtypes 
is still lacking. 

One of the major interneuron subclasses in cortical and cortex-like 
structures such as the BLA is defined by the expression of the calcium- 
binding protein parvalbumin (PV; also called PVALB)*'°"**°. Most 
PV* interneurons feature a fast-spiking phenotype with narrow spike 
widths and high firing rates”’*’. PV" interneurons preferentially form 
synapses at the perisomatic region of their target cells, thereby control- 
ling neuronal activity and spike output'®'***, In addition to glutama- 
tergic principal neurons, PV ~ cells contact both other PV~ interneurons 
and different interneuron types””>. In the hippocampus, PV~ cells have 
been shown to interact with interneurons expressing somatostatin (SOM; 
also called SST) to regulate pyramidal cell output”®. In contrast to PV~ 
cells, SOM* interneurons preferentially contact the distal dendrites”’, 
which allows them to efficiently control the impact of inputs to their 
target cells**”?, 


Here, we address the role of interneuron subtypes using a combina- 
tion of optogenetic manipulations and single-unit recordings from iden- 
tified BLA interneurons during fear learning. Our results reveal that 
BLA PV* and SOM” interneurons exert bidirectional control over fear 
acquisition resulting in differential changes in inhibition along the soma- 
todendritic axis of principal neurons. We identify inhibition of SOM~ 
interneurons by PV* interneurons as a crucial mechanism for den- 
dritic disinhibition of principal neurons. Our findings indicate that exqui- 
sitely organized inhibitory and disinhibitory microcircuits comprising 
defined subtypes of interneurons have important and distinct roles dur- 
ing associative learning. 


PV* interneurons in the BLA control fear learning 


To examine a possible role of PV ~ cells in fear learning, we selectively 
expressed channelrhodopsin-2 (ChR2 (ref. 30)) in PV interneurons 
in the BLA*" (Fig. la-c; see Methods), and used bilaterally implanted 
optic fibres for light stimulation (Fig. 1a and Extended Data Fig. 1a). 
Importantly, stimulation of PV~ interneurons alone did not have any 
effect on freezing or locomotion (Extended Data Fig. 2a). Mice were 
conditioned to two distinct CS (tones of different frequencies, coun- 
terbalanced between experiments). Whereas PV ~ cell activity was manip- 
ulated during one entire CS-US pairing, the other CS-US pair served 
as a within-animal control (Fig. 1d, left). A fear memory retrieval test 
on the next day without optogenetic stimulation revealed that PV~ 
interneuron activation during acquisition significantly attenuated the 
CS-induced freezing response when compared with the internal con- 
trol CS (Fig. 1d, right and Extended Data Fig. 2b). These results indi- 
cate that the level of PV~ interneuron activity in the BLA can control 
the acquisition of conditioned fear responses. 


US-induced inhibition of PV* interneurons gates learning 


Next, we asked whether optogenetic activation of PV~ interneurons 
specifically during the US would be sufficient to interfere with fear 
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Figure 1| BLA PV* interneurons control fear learning. a, Optogenetic 
manipulation and simultaneous single-unit recordings using an optrode in 
freely behaving mice. b, Co-expression of ChR2 and Venus (green) in PV* 
interneurons in the BLA, using PV-Cre mice and conditional rAAVs. 
Counterstains: DAPI (blue) and NeuN (red). CeL, central lateral amygdala. 

c, Restriction of ChR2 and Venus expression (green) to PV* interneurons is 
verified by PV immunohistochemistry (IHC) (red). d, Left, fear conditioning 
paradigm to test for effects of PV manipulation on fear learning. Animals 
were conditioned to two distinct auditory CSs. CS2 was paired with blue light 
for ChR2 activation during the entire CS-US pairing. During retrieval on the 
next day, both CSs were presented without light. CS order and light-pairing 
were counterbalanced. Right, animals displayed significantly less freezing to the 
light-paired CS2 during retrieval compared to CSI, indicating that PV" 
activation during the entire CS—US pairing impairs fear acquisition. Light 
stimulation had no effect on control animals expressing only GFP. Values are 
mean + s.em. *P< 0.05, ***P < 0.001. Statistical analysis in Methods. 


conditioning (Fig. 2a and Extended Data Fig. 2b). When we tested fear 
memory retrieval the next day without optogenetic stimulation, we observed. 
that learning was impaired to a similar extent as observed with PV~ 
interneuron activation during the entire CS-US pairing (Fig. 1d and 
Extended Data Fig. 2b). This demonstrates that PV" interneuron activity 
during the US is a crucial factor regulating fear learning. To address 
whether endogenous activity of PV ~ cells is necessary for fear learning, 
we used conditional expression of the inhibitory opsin ARCH” (Fig. 2a). 
In contrast to ChR2 manipulations, ARCH-mediated inhibition of pv* 
interneurons during the US resulted in increased freezing levels during 
fear memory retrieval (Fig. 2a and Extended Data Fig. 2b). Together, 
these results demonstrate that fear learning is under bidirectional con- 
trol of BLA PV* interneurons during the US. 

To relate the behavioural changes induced by these manipulations 
to the physiological role of PV interneurons, we used custom-built 
optrodes (Fig. 1a and Extended Data Fig. 1; see Methods) to perform 
chronic single-unit recordings from optogenetically identified PV * cells*». 
Units exhibiting light-induced short latency activity changes were iden- 
tified as PV" interneurons (Fig. 2b, c and Extended Data Fig. 3a—e; see 
Methods for identification criteria). In agreement with previous record- 
ings in slices, optogenetically identified PV* interneurons showed 
faster spike kinetics and higher spontaneous activity than simultaneously 
recorded non-light-responsive units (Fig. 2d and Extended Data Fig. 4). 
Recordings from identified light-responsive units (n = 31) revealed that 
firing of the PV~ interneuron population was reduced for the entire 
duration of the footshock (Fig. 2e), although a minority displayed exci- 
tation (7 out of 31, Extended Data Fig. 5a). These data are consistent 
with the observed behavioural effects: optogenetic activation of PV~ 
interneurons counteracted the endogenous US-induced inhibition and 
decreased learning, whereas optogenetic inhibition enhanced the endog- 
enous PV" inhibition resulting in enhanced fear learning (Fig. 2a and 
Extended Data Fig. 2b). 
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Figure 2 | Footshock-induced PV* cell inhibition gates fear learning. 

a, PV" activity during the US regulates fear learning. Left, conditioning as in 
Fig. 1d, but illumination with blue or yellow light was restricted to the US. Right, 
PV* activation and inhibition during the US had opposite effects on fear 
learning. Shown are differences in freezing between the two CSs. b, Optogenetic 
identification of PV~ interneurons in the BLA of freely behaving mice, based on 
light-induced changes in single-unit activity. c, Light-responsive units were 
defined as those with significant, short-latency light-induced changes in 
activity. d, Optogenetically identified PV~ units differ in spike width and 
spontaneous firing rate from simultaneously recorded non-light-responsive 
units, mainly representing principal neurons. e, Footshocks inhibit PV~ cell 
activity during conditioning (top, example response; bottom, Z-scored 
population response). f, Z-scored population responses for footshock-excited 
putative principal neurons (PNs) with (blue or yellow) and without (black) 
optogenetic stimulation of PV~ interneurons (for selection criteria see 
Methods). g, Left, Z-score differences in the footshock response induced by 
optogenetic manipulation of PV* interneurons in footshock-excited putative 
principal neurons. Right, median change index for the footshock responses of 
putative principal neurons upon PV~ cell manipulations (see Methods). 

h, Fractions of excited putative principal neurons, which are excited either 
exclusively during the footshock (black), exclusively during the footshock 

+ light (blue or yellow) or in both conditions (grey). Values are mean + s.e.m. 
unless stated otherwise. *P < 0.05, **P < 0.01, ***P < 0.001. Statistical 
analysis in Methods. 
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Because PV “ interneurons mediate strong perisomatic inhibition””*™, 


we speculated that US-induced inhibition of PV* cells causes disinhi- 
bition of principal neurons. We therefore analysed the impact of PV~ 
interneuron activity on excitatory footshock responses in putative prin- 
cipal neurons (non-light-responsive units with spontaneous firing rates 
<3 Hz; n = 112)°”!**5°*, We observed a significant decrease in excit- 
atory footshock responses upon optogenetic activation of PV~ inter- 
neurons (Fig. 2f, g and Extended Data Fig. 6). In contrast, inhibition of 
PV* interneurons significantly increased the principal neuron responses 
(Fig. 2f, g and Extended Data Fig. 6). In addition to affecting the strength 
of the footshock-induced responses, optogenetic manipulations of PV~ 
interneuron activity differentially changed the number of footshock- 
excited cells. Whereas PV~ activation led to a loss of excitation in almost 
half of the footshock-excited cells, PV~ inhibition uncovered excita- 
tory responses in a large fraction of putative principal neurons (Fig. 2h 
and Extended Data Fig. 6). To address directly whether changes in the 
footshock-induced activity of principal neurons have an effect on learn- 
ing, we expressed ARCH under the control of the CaMKII promoter 
and optogenetically inhibited principal neurons during the US. This 
manipulation significantly impaired fear learning, demonstrating the 
necessity of footshock-induced principal neuron activation (Extended 
Data Fig. 6e). Together, these results indicate that footshock-induced 
inhibition of PV ~ cells regulates activity in the BLA network in a manner 
consistent with our behavioural observations: PV* interneuron inhibi- 
tion causes disinhibition and enhances US-induced activity in principal 
neurons, probably acting as a permissive gate for plasticity induction 
and fear learning. 


Differential roles for PV* and SOM* interneurons during CS 


We next addressed the question whether inhibition of PV~ interneu- 
rons and the consequent disinhibition of principal neurons also occur 
during the auditory CS which precedes the footshock US. Unexpect- 
edly, when optogenetically manipulating the activity of PV" inter- 
neurons exclusively during the CS, we found the opposite effects on fear 
learning compared to US-specific manipulations (Fig. 3a and Extended 
Data Fig. 2b). This suggests fundamentally different roles of PV inter- 
neurons in processing of CS and US. 

In line with this, the PV* interneuron population exhibited CS-induced 
excitation (Fig. 3b), although a subset displayed inhibitory responses 
(14 out of 36, Extended Data Fig. 5c). If PV* interneurons influenced 
principal neurons mostly via monosynaptic perisomatic inhibition, we 
would expect optogenetic manipulations of PV~ interneurons to have 
an effect on learning opposite to the one that we observed. 

A possible explanation for the paradoxical effect of PV" interneur- 
ons on fear conditioning could be that PV* interneurons interact not 
only with principal neurons, but in addition with other interneuron sub- 
types to cause disynaptic disinhibition of principal neurons. Because 
sensory afferents from auditory thalamus and cortex make synapses 
onto the dendrites of BLA principal neurons’, we proposed that dendrite- 
targeting SOM“ interneurons might mediate such an effect. To address 
this, we performed optogenetic manipulations and single-unit record- 
ings from identified SOM * interneurons using the same strategy as for 
PV" interneurons (Extended Data Figs 1, 3, 4)*”. In agreement with the 
hypothesis that SOM” interneurons could be inhibited by PV* cells 
during the CS, we found that the behavioural consequences of opto- 
genetic manipulations of SOM™ cells during the CS were opposite to 
those induced by PV* manipulations (Fig, 3c and Extended Data Fig. 2b), 
and that CS presentation induced inhibition of identified SOM” inter- 
neurons (Fig. 3d and Extended Data Fig. 5d). Taken together, this indi- 
cates that the CS causes subcellular reallocation of inhibition in principal 
neurons, with disinhibition of sensory dendritic inputs, which overrides 
a concomitant increase in perisomatic inhibition (see below). 


Dendritic disinhibition by SOM* interneurons 


To address whether PV ~* interneurons indeed make synaptic contacts 
onto SOM* interneurons as has been demonstrated in the hippocampus”* 
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Figure 3 | Differential roles of PV* and SOM* interneurons during the CS. 
a, Top, conditioning as in Fig. 2a, but with optogenetic manipulation restricted 
to the CS. Bottom, PV" activity manipulations during the CS bidirectionally 
affect learning, with an opposite sign compared to manipulations during the 
US (Fig. 2a). b, Optogenetically identified PV~ cells are excited by the CS 
during conditioning (top, example trace, dashed line: average baseline firing; 
bottom, population response). c, Top, animals expressing opsins in BLA SOM* 
interneurons were conditioned as in a. Bottom, SOM~ activity manipulations 
affected learning bidirectionally, in the opposite way compared to PV~ 
manipulations. d, SOM" cells are inhibited by the CS during conditioning. 
Values are mean + s.e.m. *P< 0.05, **P< 0.01, ***P < 0.001. Statistical 
analysis in Methods. 


(Fig. 4a, top left), we combined whole-cell recordings from interneur- 
ons (n = 101) in acute brain slices with optogenetic activation of pvt 
interneurons (Fig. 4a and Extended Data Fig. 7). Post-hoc immuno- 
cytochemical identification revealed that 4 out of 11 identified SOM* 
interneurons received reliable inhibitory input from PV~ cells (Fig. 4a, 
top right and bottom; Extended Data Fig.7), which can be sufficient to 
suppress spiking in SOM * interneurons (Extended Data Fig. 7). Owing 
to potential wash-out of the antigen SOM in whole-cell recordings, these 
data represent a solid lower bound rather than the actual connectivity 
rate (see Methods). Notwithstanding, these results clearly demonstrate 
the existence ofa functional connection between PV” and SOM‘ inter- 
neurons in the BLA. 

A prerequisite for the ability of SOM™ interneurons to influence 
directly the integration of auditory inputs in principal neurons is that 
they preferentially target principal neuron dendrites”. To address this, 
we conditionally expressed green fluorescent protein (GFP) fused to the 
presynaptic marker synaptophysin in BLA SOM” interneurons. We 
observed prominent synaptic contacts on the dendrites of principal 
neurons (Fig. 4b). In contrast, PV* interneurons showed a distinct tar- 
geting pattern, preferentially contacting the perisomatic domain of prin- 
cipal neurons (Extended Data Fig. 8). We next asked whether SOM* 
interneurons can control the processing of sensory information in 
principal neurons. Electrical stimulation of thalamic sensory afferents 
in acute brain slices evoked reliable excitatory postsynaptic potentials 
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Figure 4| PV‘ and SOM* interneurons form a disinhibitory microcircuit 
in the BLA. a, Top left, proposed PV* and SOM” microcircuit and 
experimental set-up. Top right, whole-cell recordings of EGFP™ cells in 
GAD67-EGFP::PV-Cre mice, infected with rAAV-DIO-ChR2-mCherry. 
Inhibitory inputs in SOM” cells evoked by optogenetic activation of PV* 
interneurons are shown. Bottom, post-hoc IHC identification of SOM? cells 
(EGFP* cells n = 101, of those SOM* n= 11, SOM* and input n = 4, see 
Extended Data Fig. 7c). b, Example of a principal neuron dendrite (white) and 
synaptic contacts (yellow arrowheads) from SOM" interneurons (red; 
conditionally expressed presynaptic marker synaptophysin-GFP). c-g, SOM* 
interneurons modulate excitatory inputs and neuronal output of BLA principal 
neurons in slices. c, Left, experimental design. Right, ChR2-mCherry~ SOM* 
interneurons (red) and recorded principal neuron (blue) in the BLA. 

d, Example traces of excitatory thalamic (left, electrical stimulation) and 
inhibitory SOM* -cell-evoked (middle, 5 ms blue light) postsynaptic potentials 
as well as co-activation (right, electrical stimulation 5 ms after light onset). 

e, Averages of recordings in d. f, Spike output of principal neurons in response 
to thalamic afferent stimulation with and without SOM* cell activation. 

g, Normalized amplitude and decay time of thalamic EPSPs and spike 
probability in response to thalamic EPSPs with simultaneous activation of 
SOM? interneurons. Values are mean + s.e.m. *P < 0.05, ***P < 0.001. 
Statistical analysis in Methods. 


(EPSPs)** in principal neurons (Fig. 4c-e), whereas optical stimulation 
of SOM* interneurons evoked inhibitory postsynaptic potentials (IPSPs) 
(Fig. 4d, e). Co-activation caused a significant reduction of both the 
amplitude and the decay time of the evoked EPSPs (Fig. 4d, e, g), show- 
ing that SOM” interneurons can control the integration of thalamic 
inputs in principal neurons (see Methods). In addition, spike output 
induced by supra-threshold thalamic stimulation was significantly reduced 
by simultaneous activation of SOM” interneurons (Fig. 4f, g). These 
results demonstrate that PV* interneurons are capable of inhibiting 
the output of SOM* interneurons, which in turn can modulate integ- 
ration of excitatory inputs in BLA principal neurons and thereby influ- 
ence principal neuron spike output. 

The emerging picture is that CS processing in the BLA is under the 
control of a microcircuit in which PV-to-SOM inhibition disinhibits 
principal neuron dendrites during tone presentation, while the prin- 
cipal neuron perisomatic domain receives increased inhibition directly 
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from PV~ interneurons. In contrast, during the US, both PV" and SOM* 
interneurons are inhibited (Fig. 2e and Extended Data Fig. 5a, b), lead- 
ing to a qualitatively different pattern of inhibition along the somato- 
dendritic axis of principal neurons (see Discussion). 


Impact on BLA network activity 


To evaluate how inhibition supplied by PV* and SOM* interneurons 
affects CS processing in the BLA, we compared tone-evoked excitatory 
responses in putative principal neurons with and without optogenetic 
manipulation of the interneurons. Activation of SOM* cells resulted in 
a strong reduction of tone-evoked activity in principal neurons, whereas 
inhibition of SOM * interneurons caused a slight enhancement (Fig. 5a, b 
and Extended Data Fig. 9), supporting the idea that SOM“ cells directly 
inhibit principal neuron tone input processing and/or responses. Acti- 
vation of PV~ interneurons caused a substantial enhancement of tone 
responses, whereas inhibition of PV* interneurons had little effect on 
tone responses (Fig. 5a, b and Extended Data Fig. 9), indicating that 
direct PV-principal neuron inhibition is outweighed by the indirect PV- 
SOM-principal neuron disinhibitory pathway during CS processing (see 
below). In addition to effects on the average CS response of putative 
principal neurons, PV* and SOM” interneurons also differentially affec- 
ted the number of tone-excited principal neurons. Whereas conditions 
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Figure 5 | Distinct effects of PV* and SOM* interneurons on BLA network 
activity. a, Tone-evoked population responses in tone-excited putative 
principal neurons with (blue or yellow) and without (black) optogenetic 
stimulation of PV* or SOM" interneurons (for selection criteria see Methods). 
Grey bars denote tone duration; blue and yellow bars indicate optogenetic 
stimulations. b, Left, Z-score differences in tone-excited putative principal 
neurons during the tone induced by optogenetic manipulations of PV* or 
SOM" interneurons. Right, median change index for the tone responses of 
putative principal neurons upon PV* or SOM™ cell manipulations (see 
Methods). c, Fractions of the excited putative principal neurons, which are 
excited either exclusively during the tone (black), exclusively during the 

tone + light (blue or yellow) or in both conditions (grey). d, Bidirectional 
changes in spontaneous activity of putative principal neurons on optogenetic 
manipulations of PV* and SOM” interneurons. Activation of both 
interneuron types reduces (left) and inhibition enhances (right) BLA network 
activity. Values are mean + s.e.m. unless stated otherwise. *P < 0.05, 

**P < 0.01, ***P < 0.001. Statistical analysis in Methods. 
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impairing learning (SOM-ChR2 and PV-ARCH; Fig. 3 and Extended 
Data Fig. 2b) reduced the number of tone-excited cells, manipulations 
enhancing learning (PV-ChR2 and SOM-ARCH,; Fig. 3 and Extended 
Data Fig. 2b) uncovered excitatory tone responses in large populations 
of putative principal neurons that were previously unresponsive (Fig. 5c 
and Extended Data Fig. 9). Together, these results demonstrate that PV ~ 
and SOM”* interneurons have opposite effects on CS responses that are 
consistent with the observed impact on learning (Fig. 3 and Extended 
Data Fig. 2b)—manipulations that increased fear learning caused stron- 
ger tone responses in more principal neurons, whereas conditions that 
impaired learning led to smaller responses in fewer principal neurons. 
In addition, these data indicate that the described disinhibitory micro- 
circuit is an important factor governing tone processing in the BLA 
during learning. Interestingly, spontaneous activity of putative prin- 
cipal neurons was not differentially affected by manipulations of the 
two interneuron populations (Fig. 5d and Extended Data Fig. 10a, c), 
suggesting that these effects are probably stimulus-specific and reflect 
the organization of sensory inputs onto principal neurons: auditory 
inputs arriving at principal neuron dendrites are affected more strongly 
by dendritic inhibition from SOM * interneurons, potentially by locally 
preventing the initiation of dendritic spikes*°. In contrast, sponta- 
neous firing is affected by both perisomatic inhibition from PV" inter- 
neurons and dendritic inhibition from SOM" cells. 


Discussion 


The level of activation of a principal neuron in the BLA during fear 
learning is a key factor determining whether inputs to this principal 
neuron will undergo synaptic plasticity and whether cellular plasticity 
is triggered that enables recruitment of the neuron into the memory 
trace**"”, Consistent with a central role of synaptic inhibition in this 
process, we observed strong effects on fear memory retrieval when we 
manipulate distinct interneuron types during fear memory acquisi- 
tion. In addition, our data point to a multifaceted role of inhibition in 
acquisition of conditioned fear: we find that the effects of optogenetic 
manipulation of interneuron activity strongly depend on both stimulus 
identity (CS versus US) and on interneuron type (PV~ versus SOM“). 
Moreover, our results identify interneuron-—interneuron interactions as 
a novel and important factor governing auditory responses and learning. 

During the presentation of an auditory cue, PV" interneurons are, 
on average, excited, most likely through direct sensory input from audi- 
tory thalamus and auditory cortex’. As in other cortical structures”, 
this will lead to somatic feedforward inhibition in postsynaptic prin- 
cipal neurons, a process that is expected to restrict their activation and, 
as a consequence, the induction of synaptic plasticity during learning. 
Unexpectedly, our results indicate that increasing PV “ cell activity dur- 
ing the CS correlates with enhanced learning, and with stronger audi- 
tory responses in principal neurons. These observations can be explained 
in light of our finding that PV* interneurons not only contact principal 
neurons, but also supply inhibition to dendrite-targeting SOM * inter- 
neurons in the BLA. In line with this, activation of SOM ' interneurons 
reduces learning and decreases auditory responses in BLA principal neu- 
rons. Together, these results strongly suggest that during auditory pro- 
cessing, dendritic disinhibition in principal neurons overrides the effect of 
increased perisomatic inhibition, potentially through increased dendritic 
electrogenesis and associated burst firing (Extended Data Fig. 10b, d)”” 
as it has been demonstrated in the hippocampus”. Furthermore, it is 
conceivable that locally restricted regulation of dendritic inhibition adds 
alevel of control by determining which inputs contribute to action poten- 
tial firing and undergo plasticity upon association with the US”. 

In contrast, during the aversive US, PV* cells were inhibited and 
optogenetic augmentation of US-induced inhibition enhanced fear 
learning. This indicates that inhibition of PV~ interneurons repre- 
sents an important, permissive mechanism for fear learning, most likely 
by gating associative plasticity induction during the US, as has recently 
been shown in auditory cortex’. Our data now extend and complement 
this view by demonstrating that SOM* cells are also inhibited during 
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the US, indicating disinhibition of the entire somatodendritic axis of 
principal neurons, thereby strongly increasing their activity which is 
required for fear learning (Extended Data Fig. 6e). Moreover, because 
auditory inputs synapse onto dendrites*“*, dendritic disinhibition rep- 
resents an important, novel mechanism in auditory fear learning, favour- 
ing the induction of synaptic plasticity during the aversive US. A likely 
source of US-induced inhibition is other interneuron subtypes that can 
contact both PV* and SOM interneurons'?“*’, and/or fast neuro- 
modulatory processes. An important open question pertains to the mech- 
anisms bridging the temporal gap between CS-related firing and activity 
caused by the US. Potential candidates include dendritic plateau poten- 
tials and/or second messengers such as calcium***“’, and future exper- 
iments will be required to test these possibilities. 

Although we observed that most PV* and SOM* interneurons exhib- 
ited CS and US responses consistent with the effects of optogenetic manip- 
ulations on learning, both interneuron types also displayed response 
heterogeneity (Extended Data Fig. 5). This may at least partially reflect 
anatomically defined subtypes of interneurons, for example, PV™ basket 
or PV* chandelier cells (see ref. 14). Alternatively, the functional het- 
erogeneity might also be due to differential input or output connections 
of interneuron subgroups. Future advances in genetic targeting of spe- 
cific neuronal subpopulations may allow a more detailed dissection of 
the role of these functional subgroups of interneurons. 

Different types of interneurons exhibit distinct temporal dynamics 
during different network states that correlate with specific aspects of 
behaviour**”°. It has been postulated that direct interactions between 
soma-targeting and dendrite-targeting interneurons are an important 
mechanism by which spatio-temporal patterns of inhibition might be 
regulated”*. Here, we provide evidence that similar mechanisms can 
govern associative learning and the strength of the memory trace. In 
the identified microcircuit, differential modulation of PV* and SOM* 
interneurons may permit flexible regulation of learning according to 
the behavioural context and the animal’s internal state. 


METHODS SUMMARY 


PV-Cre*’ or SOM-Cre®’ mice (2-4 months) were used. Full details of animals, 
materials and methods for virus injections, optogenetic stimulations, behaviour, 
single-unit recordings and slice electrophysiology are provided in Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Animals. Male PV-Cre (ref. 31) or SOM-Cre (ref. 37) mice (2-4 months) were 
individually housed in a 12 h light/dark cycle while the experiments were running. 
Before behavioural experiments, animals were habituated to the experimenter by 
handling for =3 times. Food and water were available ad libitum. Behavioural 
experiments were performed during the light cycle. All animal procedures were 
performed in accordance with institutional guidelines and were approved by the 
Veterinary Department of the Canton of Basel-Stadt. 

Virus injection and optogenetics. Mice were anaesthetized with isoflurane (Attane, 
Provet; induction 5%, maintenance 1.5%) in oxygen-enriched air (Oxymat 3, Wein- 
mann) and fixed in a stereotactic frame (Kopf Instruments). Local injections of 
ropivacain (Naropin, AstraZeneca) under the scalp and systemic injections of mel- 
oxicam (60 pl of 0.5 mg ml}, intraperitoneal, Metacam, Boehringer Ingelheim) 
were provided for analgesia. A feedback-controlled heating pad (FHC) assured 
maintenance of the core body temperature at 36.5 °C. 

For selective expression of opsins, conditional Cre-dependent recombinant adeno- 

associated viruses (rAAV) were injected into the basolateral amygdala (BLA) of 
PV-Cre or SOM-Cre mice, respectively. For expression of the excitatory chan- 
nelrhodopsin-2 (ref. 30) we used a custom rAAV 2/7 EF1a::DIO-ChR2(H134R)- 
2A-NpHR-2A-Venus***! or rAAV 2/5 EFla::DIO-ChR2(H134R)-mCherry, and 
for expression of the inhibitory proton pump ARCH”, we used an rAAV 2/5 CBA:: 
flex-ARCH-GFP. For specific expression of ARCH in principal neurons, we used 
an rAAV 2/5 CaMKII::ARCH-GFP (all rAAVs Vector Core, University of Penn- 
sylvania). rAAVs (~0.5 ul per hemisphere) were delivered into the BLA using glass 
pipettes (tip diameter 10-20 |tm) connected to a picospritzer (Parker Hannifin 
Corporation) at the following coordinates: 1.7 mm posterior to bregma, +3.3 mm 
lateral to the midline and 4.1-4.2 mm below the cortical surface. For optogenetic 
manipulations during behaviour without electrophysiological recordings, animals 
were additionally implanted with bilateral custom-built optic fibre connectors (fibre: 
0.48 numerical aperture, 200 um diameter, Thorlabs). Fibre tips were lowered to 
300 um above the BLA at the injection site (Extended Data Fig. 1a). Implants were 
fixed to the skull with skull screws, cyanoacrylate glue (Ultra Gel, Henkel) and den- 
tal cement (Paladur, Heraeus). After surgery, animals were allowed to recover for 
at least 4 weeks before behavioural training to ensure sufficient expression of the 
opsins. For optogenetic manipulations during behaviour, the implanted fibres were 
connected to a custom-built laser bench using an AOTF (AA Opto-Electronic) to 
control laser intensity (lasers: MBL473, 473 nm wavelength and MGL593.5, 593.5 nm 
wavelength, CNI Lasers). To ensure that animals could move freely, the connecting 
fibres were suspended over the behavioural context. 
Optrode recordings and optogenetic identification of interneurons. In most 
animals one optical connector was replaced by a custom-built optrode to allow 
simultaneous optogenetic stimulations and single-unit recordings during beha- 
viour. Optrodes consisted of an optic fibre connector with an electrode attached 
directly to the fibre, with the tip protruding approximately 300-500 jum beyond the 
fibre. Electrodes were made of 16 individually insulated, gold-plated nichrome 
wires (13 ,tm inner diameter, impedance 30-100 kQ, Sandvik), attached to a con- 
nector (18 pin, Omnetics)“*. The whole implant was fixed to the skull as described 
above. Single-unit recordings were performed as described below. 

For optogenetic identification of PV* orSOM‘ interneurons, we tested all recorded 
units in all animals for light responsiveness*’. We presented pulses of either blue 
or yellow light to activate ChR2 or ARCH, respectively (300 ms, 60 pulses, 2-s inter 
pulse interval, 15-20 mW at fibre tip). Units were considered as light responsive if 
they showed significant, time-locked (<7 ms) changes in neuronal activity on illu- 
mination (Fig. 2c and Extended Data Fig. 3i). The onset of optogenetic excitation 
was determined as the average latency of the first light-evoked spike in each trial. 
To determine the onset of inhibition reliably in neurons with low baseline firing 
rates, we applied a change-point analysis” (Change Point Analyzer 2.0, Taylor Enter- 
prises Inc.). Change point analysis identifies the time point exhibiting a significant 
change in neuronal activity relative to the preceding time points. Change points are 
graphically represented by a change in the slope of a plot showing the cumulative 
sums of the firing frequency averaged over the 60 light trials (Extended Data Fig. 3e). 

To ensure that light-evoked and spontaneous spikes of identified interneurons 
were indeed originating from the same neuron, we calculated linear correlation 
(r) values for spontaneous and light-evoked spikes to quantitatively determine the 
similarity of their waveform shapes. Only groups of spikes with r> 0.95 were 
considered as originating from the same cell (Extended Data Fig. 3d, j). 
Behaviour and optogenetic manipulation of behaviour. Before behavioural 
training, light-responsive interneurons were identified in opsin-expressing PV- 
Cre or SOM-Cre animals, as described above (Fig. 2b-—d and Extended Data Fig. 3). 
Depending on the presence of identified interneurons, mice were submitted to 
different auditory fear-conditioning paradigms. To determine the physiological 
activity of identified interneurons during fear learning, we used a discriminative 
fear-conditioning paradigm without optogenetic manipulations”’. On training day 1 
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(habituation), animals were presented with 4 repetitions ofa CS" anda CS’ (total 
CS duration of 30 s, consisting of 50-ms pips repeated at 0.9 Hz, 2-ms rise and fall; 
pip frequency: 7.5 kHz or white noise (WN), counterbalanced for randomization, 
75 dB sound pressure level). On day 2 (fear conditioning) animals were conditioned 
by pairing the CS* with the US (1-s footshock, 0.75 mA DC, 5 CS/US pairings; 
inter-trial interval: 20-180 s; US applied at the time of next expected pip occur- 
rence, ~1.1s after last pip). The CS” was intermingled with the CSt, but never 
reinforced by a US. During fear retrieval on the next day, 4 repetitions of first the 
CS~ and then of the CS* were presented to test for fear learning. Single-unit activ- 
ity was recorded during all behavioural sessions. Light responsiveness of the iden- 
tified units was tested before habituation and after the last behavioural session. 

Animals without identified interneurons were subjected to a fear conditioning 
paradigm combined with optogenetic manipulations. In animals with implanted 
optrodes, single-unit activity of non-light-responsive units was recorded during 
all behavioural sessions. On day 1 (habituation), two different CSs (CS1 and CS2) 
were presented 4 times to the animal (total CS duration of 10s, consisting of 
50-ms pips repeated at 0.9 Hz, 2-ms rise and fall; pip frequency: 3 kHz or 12 kHz, 
counterbalanced for randomization, 75 dB sound pressure level). In addition, 
animals were presented with 4 repetitions of the light stimuli, which were used 
in the subsequent conditioning, without a CS. These stimulations were also used 
to determine baseline effects on freezing and locomotion. On day 2 (fear con- 
ditioning), animals were fear conditioned by pairing both CS1 and CS2 with a US 
(1-s foot shock, 0.75 mA DC, 1 CS-US pairing each, interval between CSs: 180 s; 
US applied at the time of next expected pip occurrence, ~1.1s after last pip). 
While one of the two CSs served as an internal control, the other CS was paired 
with light stimulation. Light was applied either only during the CS (blue light: 
300-ms pulses starting 50 ms before each pip; yellow light: continuous illumina- 
tion for 10s starting with the onset of first CS pip), only during the US (blue or 
yellow light: 1-s continuous illumination, coinciding with the US), or during CS 
and US (combination of the described stimulation patterns). The order of CS1 and 
CS2 and the pairing with light stimulation were counterbalanced within behav- 
ioural groups. On day 3 (retrieval), animals were tested for fear memory retrieval. 
Both CS1 and CS2 were presented 4 times each without light stimulation or rein- 
forcement. Freezing induced by the control CS and the light-paired CS was compared 
to determine the effect of the optogenetic manipulation in each animal (Extended 
Data Fig. 2). The order of CS1 and CS2 during the test was counterbalanced within 
behavioural groups. Because the behavioural effects of optogenetic manipulations 
could not be pre-specified before the experiments, we chose sample sizes with enough 
repetitions to allow for identification of outliers and for validation of experimental 
reproducibility (n = 7-11). 

To test for the effects of optogenetic manipulations of interneuron activity on 
footshock- or tone-induced activity in the BLA network, we performed additional 
behavioural sessions in animals with implanted optrodes. We analysed footshock 
and tone responses in unidentified cells. This population consists mostly of prin- 
cipal cells with a minority of interneurons*”'**>**. To exclude fast-spiking inter- 
neurons, we restricted the analysis to cells with a baseline firing rate <3 Hz. To 
determine effects on footshock-induced activity, we delivered 20 footshocks (1 s, 
0.75 mA DC) to the animals, alternately with and without coincident stimulation 
with blue or yellow light. To study the effects of interneuron activity manipula- 
tions on tone processing, we used 8 presentations of tones (white noise, 30 s total 
duration, consisting of 50-ms pips repeated at 0.9 Hz, 75 dB sound pressure level) 
alternately with and without coincident light stimulation. For activation of ChR2, 
we used a train of blue light stimuli (300-ms pulses starting 50 ms before each pip) 
and for ARCH we used continuous illumination with yellow light (30 s), as for the 
manipulations during fear learning (see above). For the analysis of the light- 
induced changes in shock or tone responses see the corresponding sections below. 

Fear conditioning was performed in a context (context A) different from that 
used for all other behavioural sessions (context B). Context A was cleaned with 
70% ethanol and context B with 1% acetic acid. Freezing behaviour was quantified 
using an automatic infrared beam detection system (Coulbourn Instruments) as 
described previously’. The animals were considered freezing if no movement was 
detected for 2s. 

In vivo electrophysiology and analysis. Single-unit spike sorting was performed 
using Off-Line Spike Sorter (OFSS, Plexon) as previously described*’. In summary, 
principal component scores were calculated for unsorted waveforms and plotted 
on three-dimensional principal component spaces. Clusters containing similar wave- 
forms were manually defined (Extended Data Fig. 1b-e). A group of waveforms 
was considered to be originating from a single neuron if it defined a discrete cluster 
in principal component space that was distinct from clusters for other units (deter- 
mined by MANOVA) and if it displayed a clear refractory period (>1 ms) in the 
auto-correlogram. In addition, we used two parameters to quantify the overall sepa- 
ration between identified clusters in an individual channel. The J3 statistic corre- 
sponds to the ratio of between-cluster to within-cluster scatter, and the Davies-Bouldin 
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validity index (DB) reflects the ratio of the sum of within-cluster scatter to between- 
cluster separation. To obtain control values for these statistics, two clusters from 
the central cloud of points in the principal component space from channels with- 
out detectable units were artificially defined. High values for the J3 and low values 
for the DB compared to the control values indicate good single-unit isolation (Ex- 
tended Data Fig. 1c). Template waveforms were then calculated for well-separated 
clusters and stored for further analysis. Clusters of identified neurons were ana- 
lysed offline for each recording session using principal component analysis and a 
template-matching algorithm. Only stable clusters of single units recorded over the 
time course of the entire behavioural training were considered. Long-term single- 
unit stability was evaluated using Wavetracker software (Plexon) to calculate prin- 
cipal component space-cylinders from a 2-min recording of spontaneous activity 
before each training session. Straight cylinders suggest stability of the unit isolation 
during all behavioural sessions (Extended Data Fig. 1d). In addition, we determined 
single-unit stability by quantitative evaluation of the similarity of waveform shapes 
by calculating linear correlation (r) values between average waveforms obtained 
over training days”’. As a control, we computed the r values from average wave- 
forms of different neurons (Extended Data Fig. le). 

To exclude multiple recordings of the same neuron on different channels, we 
computed cross-correlation histograms. Ifa target neuron presented a peak of activity 
at the time when the reference neuron fired, only one of the two neurons was con- 
sidered for further analysis. 

Neuronal activity is reported either as frequency or as Z-score value. Z-score 
values were calculated by generating a peri-stimulus time histogram (PSTH) of 
the binned neuronal data and summation of all spikes of all cells in each individual 
bin. The average baseline count preceding stimulus onset was subtracted from this 
population spike count. This difference was divided by the baseline standard devi- 
ation to obtain the Z score. 

To analyse bursting activity we used the Poisson surprise method™. Bursts were 
compared between baseline activity and illumination periods (blue or yellow light; 
1-s illumination and 1-s baseline before light onset). Briefly, this method detects 
bursts by finding at least 3 spikes with consecutive interspike intervals (ISI) shorter 
than half the mean ISI, and calculates the ‘unlikeliness’ or ‘surprise’ that a given ISI 
would be expected if the spike train was generated by a Poisson process. The Pois- 
son surprise method is a rigorous detector of bursts, because it is insensitive to fluc- 
tuations in average firing rate. We chose a surprise value of 3, corresponding to a 
probability of 0.05 of finding a burst of spikes within a random sequence. 
Analysis of optogenetic modulation of shock responses in putative principal 
neurons. Shock responses and their modulation by optogenetic activation or 
silencing of PV" cells were studied in putative principal neurons as described above. 
Analysis was restricted to putative principal neurons, defined as units whose mean 
firing rate during a 100-s baseline period was <3 Hz (ref. 36). A peri-stimulus time 
histogram for the shock stimulus, PSTH°(t), was calculated with 50-ms resolution 
between t = —10s tot = 2s relative to shock onset for each cell. From this the mean 
(1) and standard deviation (o§) of the firing rate preceding each shock (t = — 10s 
to t= 0s) were calculated and used to generate the Z-score normalized shock- 
triggered PSTH: 


PSTHS(t) — 5 
oS 


Z(t) = 


Units were defined as shock responsive (S) if the response in one of the 20 time bins 
between t = 0s and t = 1s exceeded a Z value of 3.72 (corresponding to P < 0.001, 
corrected for multiple comparisons). Analysis was performed for the shock res- 
ponses in the presence of light (SL) in the same way and all units showing excitatory 
shock responses in at least one of the two conditions were pooled. Response 
magnitudes for the S and SL conditions were defined as: 


R =Z(t), 9-1 s 


RY =Z"(t), 9-15 


correspondingly. To quantify the effect of light on the shock responses, a change 
index (CI) was calculated for each unit: 

RL — RS 
~ RSEERS 

For purely excitatory responses, CI values can range between + 1 (units that re- 
spond to shock only in the presence of light) and — 1 (units that respond to shock 
only in the absence of light). CI values outside the range (— 1,1) indicate inhibitory 
responses to one of the conditions. A value of 0 indicates no effect of light. For both 
animal groups (PV-ChR2, PV-ARCH), several measures were used to quantify the 
effect of light on shock responses at the population level. First, the average differ- 
ences of the responses (RS" — RS) were calculated (F ig. 2g, left). Next, the median CI 
for the shock responsive units in each group was calculated (Fig. 2g, right). Finally, 


CI 


the fraction of S only, SL only, and S and SL units from all responsive units was also 
calculated for both animal groups (Fig. 2h). To obtain the population activity of all 
shock-excited units, all spikes of all excited units were summed in each individual 
bin ofthe PSTH. From this the mean and standard deviation of the population base- 
line (t= —10s to t= 0s) were determined and used to generate the Z-score nor- 
malized shock-triggered PSTH for the population of shock-excited units (Fig. 2f). 
Analysis of optogenetic modulation of tone responses in putative principal 
neurons. Tone responses and their modulation by optogenetic activation or silenc- 
ing of PV* or SOM" cells were studied in putative principal neurons as described 
above. Tones were 50-ms white noise bursts presented at a rate of 0.9 Hz. Each 
block contained 27 tone pips, but only the first 9 were analysed, to match the behav- 
ioural paradigm. 

Analysis was restricted to putative principal neurons, defined as units whose 
mean firing rate during a 100-s baseline period was <3 Hz (ref. 36). A peri-stimulus 
time histogram for the tone stimulus, PSTH1(t), was calculated with 10-ms reso- 
lution between t = —200 ms to t = 300 ms relative to tone onset. The mean ( HB) 
and standard deviation (a) of the firing rate preceding each tone pip (t = —200 ms 
to t = 0 ms) were calculated and used to generate Z-score normalized tone-triggered 
PSTH: 


_ PSTH"(t) — 5 


T 
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Units were defined as tone responsive (T) if the response in one of the 6 time bins 
between t = 0 ms and t = 60 ms exceeded a Z value of 3.59 (corresponding to P < 0.001, 
corrected for multiple comparisons). Analysis was similarly performed for the tone 
responses in the presence of light. Cells were defined as responsive to tone in the 
presence of light (TL) ina similar fashion, the only modification being that for ChR2 
stimulation, which was non-continuous and consisted of 300-ms light pulses pre- 
ceding the tone by 50 ms, only the time period between t = —50 ms and t = 0 ms 
was used for calculating jf! and of". This was done to avoid confounding tone 
responses with firing rate changes induced indirectly by the light, which could change 
the baseline and as a result increase the standard deviation estimate. All units show- 
ing excitatory tone responses in at least one of the two conditions were pooled. 
Response magnitudes for the T and TL conditions were defined as: 


R™=Z"(t), 9-60 ms 


R™ =Z™(t), 9-60 ms 


correspondingly. To quantify the effect of light on the tone responses, a change 
index (CI) was calculated for each unit: 


RIL = R® 
~ RT + RT 

For purely excitatory responses, CI values can range between +1 (units that re- 
spond to tone only in the presence of light) and — 1 (units that respond to tone only 
in the absence of light). CI values outside the range (— 1,1) indicate inhibitory re- 
sponses to one of the conditions. A value of 0 indicates no effect of light. For each 
animal group (PV-ChR2, PV-ARCH, SOM-ChR2, SOM-ARCH), several mea- 
sures were used to quantify the effect of light on tone responses at the population 
level. First, the average differences of the responses (R' — R") were calculated 
(Fig. 5b, left). Next, the median CI for the tone-responsive units in each group was 
calculated (Fig. 5b, right). Finally, the fraction of T only, TL only, and T and TL units 
from all responsive units was also calculated for each animal group (Fig. 5c). To 
obtain the population activity of all tone-excited units, all spikes of all excited units 
were summed in each individual bin of the PSTH. From this the mean and standard 
deviation of the population baseline (t = —200 ms to t = 0 ms) were determined and 
used to generate the Z-score normalized tone-triggered PSTH for the population of 
tone-excited units for each animal group (Fig. 5a). 
Slice electrophysiology. Coronal brain slices were prepared from 3-6-month-old 
PV-Cre::GAD-EGFP or SOM-Cre::GAD-EGFP mice at least 4 weeks after virus 
injection. Animals were anaesthetized with isoflurane and decapitated in ice-cold 
artificial cerebrospinal fluid (ACSF in mM: NaCl 124, NaH>PO, 1.25, KCI 2.7, NaHCO; 
26, ascorbic acid 2.25, CaCl, 2, MgCl, 8.7, glucose 18). 300-j1m-thick amygdala 
slices were cut with a sapphire blade (vibrating microtome: Microm HM 650 V), 
transferred to an interface chamber and incubated for 45 min at 37°C in ACSF 
(1.3 mM MgCl). Slices were next transferred to a recording chamber onan upright 
microscope (Olympus, BX61W1) and superfused with ACSF (as above, except for 
1.3mM MgCh, 2.5 mM CaCl, 10 uM CNQX, 10 1M CPP, 35 + 1 °C). Interneu- 
rons were identified as EGFP-positive, mCherry-negative cells using epifluorescence 
(Lumen Dynamics, X-Cite 120 Hg-Lamp, Q-Imaging, RETIGA EXi camera, Ex- 
tended Data Fig. 7a). Electrophysiological recordings from basolateral amygdala 
interneurons and pyramidal cells were acquired in whole-cell mode using 3-5 MQ 
glass pipettes. Internal solutions were as follows: voltage clamp (in mM): CsCl 110, 
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K-gluconate 30, EGTA 1.1, HEPES 10, CaCl, 0.1, Mg-ATP 4, Na-GTP 0.3, 0.4% 
biocytin, QX 314 chloride 4; current clamp (in mM): K-methylsulphate 130, KC15, 
HEPES 10, EGTA 0.6, Na-phosphocreatine 10, Mg-ATP 4, Na-GTP 0.3, 0.4% 
biocytin. ChR2-expressing SOM* or PV* interneurons were optogenetically stim- 
ulated with a blue light LED (465 nm, Plexon, LED-driver LD-1) ora blue light laser 
(473 nm, see above) coupled to an optical fibre, which was positioned above the slice 
(stimulation intensities: 10-20 mW). Importantly, connectivity experiments (Fig. 4a 
and Extended Data Fig. 7) were designed to provide positive evidence for a PV-SOM 
connection, but for technical reasons did not allow to determine precise connectivity 
rates. The experimental approach we have chosen essentially excludes the detec- 
tion of false-positive cells, and therefore provides strong evidence for the existence 
of this connection. In contrast, it is likely that false-negative results have lowered 
the actual number of SOM* cells receiving PV* cell input in our experiments. Because 
IPSCs were measured in the whole-cell configuration, wash-out of the antigen SOM 
during the recording is likely to prevent correct post-hoc identification of SOM* 
cells in some cases. Consistent with this, we see an under-representation of SOM* 
interneurons in our sample, compared to previous studies*’. Therefore, based on 
our experiments, we cannot determine the actual connectivity rate between PV~ 
and SOM‘ interneurons, but rather obtain a solid lower bound for this connectivity. 

Excitatory postsynaptic potentials (EPSPs) were elicited with a bipolar stimu- 

lation electrode (twisted platinum/iridium wire) placed in the slice medial to the 
LA to stimulate thalamic afferents. This approach allowed us to partially mimic 
the in vivo situation and to determine the impact of the population of SOM* 
interneurons on the integration of thalamic inputs and CS processing in principal 
neurons. For spike probability experiments, stimulus intensity was adjusted to 
yield EPSPs close to spike threshold, such that control EPSP-evoked spike prob- 
ability was less than 1. Current clamp recordings for synaptically stimulated action 
potentials were performed without excitatory neurotransmitter blockers and Vin 
was set at —60 mV via somatic current injection. Data were acquired with a Mul- 
ticlamp 700B amplifier, Digidata 1322A A/D converter and pClamp 9 software (all 
Axon instruments) at 50 kHz and filtered at 4 or 10 kHz in voltage or current- 
clamp mode, respectively. Series resistance (Rs) was monitored continuously and 
experiments were typically discarded when Rs exceeded 30 MQ. Outside-out patches 
were pulled at the end of the experiment and cells were fixed for 1h in 4% para- 
formaldehyde before immunohistochemistry. 
Determination of the subcellular distribution of synaptic contacts. To deter- 
mine the distribution of synaptic contacts made by SOM* or PV™ interneurons 
on different subcellular compartments of principal neurons, we injected a rAAV 
for the conditional expression of GFP, coupled to the presynaptic marker synap- 
tophysin”® into the BLA of 3-month-old SOM-Cre or PV-Cre animals, respectively. 
Two to four weeks after virus injection, coronal brain slices for slice electrophy- 
siology were prepared as explained above. Using the voltage-clamp internal solution, 
pyramidal neurons in the basolateral amygdala (one cell per slice) were filled with 
biocytin for 10-15 min before pulling an outside-out patch. Brain slices were fixed 
for 1 h in 4% paraformaldehyde and stored in PBS. Incubation with streptavidin- 
conjugated Alexa 568 (1:1,000, Invitrogen) for visualization of the biocytin-filled 
pyramidal neuron was done overnight at 4°C. No staining for synaptophysin- 
coupled GFP was performed because native fluorescence was sufficient. 

Imaging was carried out using a confocal laser scanning microscope (LSM700, 
Zeiss) equipped with 488 nm and 555 nm laser diodes. Overview images of biocytin- 
filled neurons were acquired with a 20X objective (2.5 um z-sections). Soma and 
dendritic branches were scanned with a 63 objective (Plan- Apochromat 63 X/1.40 
Oil DIC, Zeiss; image size 1,012 X 1,024 pixels, pinhole 1 Airy unit, 2.3-fold digital 
zoom, 130 nm z-interval). Sections from both apical and basal dendrites were cho- 
sen at a distance of 50 tim, 100 tm and 200 pm from the soma. 

Deconvolution of 3D images was performed using Huygens Software (Scien- 
tific Volume Imaging). Contact points between presynaptic boutons as defined by 
synaptophysin-GFP expression and postsynaptic principal neuron compartments 
were identified in deconvolved 3D images and manually counted using Imaris 
Software (Bitplane AG). The length of individual dendritic sections was determined 
in 3D using the TREES toolbox (http://www.treestoolbox.org, ref. 57) in Matlab 
(Mathworks). Total number of dendritic contacts from SOM* and PV™ interneu- 
rons was extrapolated assuming a total length of the dendritic tree of 2,000 jm**. 
Histological verification of optrode placement and virus expression. After com- 
pletion of experiments, mice were deeply anaesthetized with avertin (0.3 gkg” ') 
and an electrolytic lesion was made at the electrode tip by applying 0.1 1A for 15s 
to two of the electrode wires. Mice were then transcardially perfused with phos- 
phate-buffered saline (PBS) followed by freshly made, ice-cold 4% paraformalde- 
hyde (PFA). Brains were extracted and kept in PFA overnight for post-fixation. 
Coronal, 60-m-thick brain slices were then cut with a vibratome (VT1000 S, Leica) 
and stored in PBS containing 0.05% sodium azide. To visualize virus expression, 
standard immunostaining procedures were performed on free-floating brain sections: 
overnight incubation at 4 °C to chicken anti-GFP antibody (1:1,000, catalogue no. 
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A10262, Invitrogen), together with mouse anti-NeuN (MAB377, Invitrogen), 2h 
incubation with anti-chicken Alexa 488, and anti-mouse Alexa 594 antibodies, 
as well as 5 min exposure to 4’,6-diamidin-2-phenylindol (DAPI, 1:10,000) as a 
counterstain. For PV and SOM staining, mice were transcardially perfused with 4% 
PFA additionally containing 1.5% picric acid and 0.05% glutaraldehyde, and post- 
fixation was omitted. Before application of the primary antibody, slices were 
incubated for 30 min at room temperature with iFX-enhancer (Invitrogen) to achieve 
a better signal-to-noise ratio. Slices were then incubated at 4 °C for 48 h with either rat 
anti-SOM antibody (1:500, catalogue no. MAB354, Millipore) or anti-PV antibody 
raised in guinea-pig (1:500, catalogue no. 195004, Synaptic Systems), followed by 
overnight incubation at 4 °C with secondary anti-rat or anti-guinea-pig Alexa 647 
antibodies (1:1,000, both Invitrogen). Slices were mounted on gelatin-coated glass 
slides, coverslipped and imaged using a stereomicroscope (Leica) with a mono- 
chrome camera (Qimaging) and LED (Lumencor) light source. Using the merged 
epifluorescence images of virus expression and counterstain as well as the reflected 
light image of the same sections, we determined viral expression as well as place- 
ment of the electrode and optical fibres by assessing the electrolytic and mechanical 
lesion produced by the electrode tip or fibre, respectively. Mice were included into 
the analysis if they showed virus expression bilaterally within BLA, the electrode tip 
was placed inside of the BLA and fibre tip placement was not more than approxi- 
mately 400 lum away from BLA. Inclusion of animals was done blind to the beha- 
vioural group and the behavioural results. 

Histological verification of specificity of viral expression. To test specificity of 
viral delivery of opsins, we quantified co-localization of the virally transduced opsin 
tag with the corresponding peptide antigen visualized by immunohistochemistry 
(IHC). Therefore, we stained 60-t1m-thick brain sections from perfused and picric 
acid/PFA fixed brains for GFP (chicken anti-GFP, A10262, Invitrogen, 1:1,000) and 
either PV (guinea-pig anti-PV, 195004, Synaptic Systems, 1:500) or SOM (rat anti- 
SOM, MAB354, Millipore, 1:500) using protocols as described above. Per mouse 
(n = 2 for each genotype, PV and SOM), three slices of the amygdala correspond- 
ing to anterograde, medial and posterior regions ( = 6 per genotype), respectively, 
of the BLA were scanned using a confocal microscope (20 objective, z-stacks of 
1 jum). Image stacks were 3D-cropped to 400 jim X 400 jum X stack height (21-40 jim) 
for anterograde amygdala, and 500 um X 500 jum X stack height (20-48 jum) for 
medial and posterior amygdala regions of interest. Within these regions, cells posi- 
tive for the viral tag and/or the IHC label were manually counted (Imaris software, 
Bitplane AG), cell numbers normalized, averaged, and co-localization calculated. 
Histological verification of cell identity in patched slices. After completion of 
in vitro electrophysiological experiments, 300-1m-thick slices were post-fixed for 
1 hin fresh 4% PFA, then washed and stored in PBS. Staining for GFP, PV and SOM 
was performed as stated above, and for visualization of the biotin-filled patched 
cell, streptavidin conjugated to Alexa 405 (Invitrogen) was added together with the 
secondary antibodies. Staining for the mCherry tag of the injected virus was omitted 
due to sufficient fluorescence of the native protein. Slices were imaged using a 
confocal laser scanning microscope (LSM700, LSM/710; Zeiss) to determine quad- 
ruple co-localization of PV or SOM, respectively, with GFP, mCherry and Biotin. 
Notably, we cannot exclude wash-out of the stained antigens during the whole-cell 
patch-clamp recordings. Therefore, our approach, rather than providing a precise 
estimation of connectivity rates between PV* and SOM7* interneurons, provides 
a solid lower bound of those connections. 

Statistical analysis. All distributions passed tests for normality (Kolmogorov- 
Smirnoy) and for equal variance (Levene Median), unless noted differently. Sta- 
tistical tests were performed for specific experiments as follows. 

Figure 1d: effects of optogenetic PV” activation during CS and US on acquisi- 
tion of fear memory. Comparison of freezing to two different CSs (internal control 
and with PV* cell manipulation during conditioning) in experimental and control 
animals (n = 11 and n = 10, respectively). Two-way ANOVA identified a signifi- 
cant interaction between group and CS: F(;,33) = 4.824, P< 0.05. Post-hoc pair- 
wise Tukey’s tests revealed significant differences between CS1 and CS2 (P < 0.05), 
between CS1 and CS2 within the experimental group (P < 0.01) and between the 
control and the experimental group within CS2 (P < 0.05). 

Figure 2a: comparison of differences between CS1 (control) and CS2 (with 
optogenetic manipulation during conditioning) for PV~ cell activation (n = 11) 
or inhibition (n = 9) during the US. Two-tailed, unpaired Student’s t-test revealed 
a significant difference (P < 0.001). In addition, freezing levels to the two CSs were 
compared separately for PV~ cell activation and inhibition. Two-tailed, paired 
Student’s t-tests revealed significant differences (P < 0.05 for PV* cell activation; 
P<0.001 for PV~ cell inhibition). 

Figure 2d: comparisons of spike half-width and spontaneous activity between 
optogenetically identified PV“ cells (n = 36) and putative principal neurons from 
the same recordings (n = 142). Owing to a non-normal distribution, the Mann- 
Whitney rank sum test was used and revealed significant differences (for both: 
P<0.001). 
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Figure 2g: left: differences in average Z scores of putative principal neurons 
during the footshock with and without optogenetic activation (n = 56) or inhibi- 
tion (n = 56) of PV* interneurons. Owing to non-normal distributions, the Mann- 
Whitney rank sum test was used and revealed a significant difference between the 
effects of PV activation and inhibition (P < 0.001). In addition, the Wilcoxon signed 
rank test was used to determine statistical significance of the differences in each indi- 
vidual group and revealed significant differences between the footshock responses 
with and without light for both PV~ interneuron manipulations (PV™ interneuron 
activation P< 0.01; PV~ interneuron inhibition P < 0.05). Right: no significant 
differences were found for the median change index in putative principal neurons 
for PV* interneuron activity manipulations. 

Figure 3a: Comparison of differences between CS1 (control) and CS2 (with 
optogenetic manipulation during conditioning) for PV~ cell activation (n = 10) 
or inhibition (n = 8) during the CS. Two-tailed, unpaired Student’s t-test revealed 
a significant difference (P < 0.01). In addition, freezing levels to the two CSs were 
compared separately for PV* cell activation and inhibition. Two-tailed, paired 
Student’s t-tests revealed significant differences (P< 0.05 for PV™ cell activation 
and inhibition). 

Figure 3c: comparison of differences between CS1 (control) and CS2 (with opto- 
genetic manipulation during conditioning) for SOM” cell activation (n = 10) or 
inhibition (n = 8) during the CS. Two-tailed, unpaired Student’s t-test revealed a 
significant difference (P < 0.001). In addition, freezing levels to the two CSs were 
compared separately for SOM™ cell activation and inhibition. Two-tailed, paired 
Student’s t-tests revealed significant differences (P < 0.01 for SOM cell activation 
and P< 0.05 for SOM* cell inhibition). 

Figure 4g: comparisons of EPSP size (n = 9 cells), EPSP decay time (n = 9) and 
of spike probability (n = 6) between electrical stimulation and electrical stimula- 
tion with optogenetic activation of SOM* cells. Data were normalized to control 
conditions. Two-tailed, paired Student’s t-tests revealed significant differences 
(P< 0.05 for spike probability; P< 0.001 for EPSP size and decay time). 

Figure 5b: left, comparisons of the differences of average tone responses with 
and without optogenetic manipulations between the different animal groups (PV- 
ChR2, n = 36 cells; PV-ARCH, n = 18; SOM-ChR2, n = 19; SOM-ARCH, n = 12). 
One-way ANOVA indicated significant main effects between animal groups: 
F(3,80) = 3.166, P < 0.05. Post-hoc pairwise Tukey’s tests revealed a significant dif- 
ference between PV—ChR2 and SOM-ChR2 (P < 0.05). In addition, two-tailed, 
paired Student’s t-test was used to determine statistical significance of the differ- 
ences in each individual group and revealed an effect in the SOM-ChR2 group 
(P < 0.05). Right, comparisons of the median change index for the tone responses 
with and without light between the different animal groups (as in the left panel). 
Owing to a non-normal distribution, the Kruskal-Wallis one-way ANOVA on ranks 
test was used and revealed significant differences between the different animal groups 
(H = 12.975, 3 d.f, P< 0.01). Pairwise multiple comparisons (Dunn’s method) revealed 
significant differences between PV-ChR2 and SOM-ChR2 (P < 0.05) and between 
SOM-ChR2 and SOM-ARCH (P< 0.05). In addition, the Wilcoxon signed rank 
test was used to determine statistical significance of the median Cls for the indi- 
vidual groups and revealed effects for PV-ChR2 (P< 0.01) and for SOM-ChR2 
(P<0.05). 

Extended Data Fig. 1c: comparisons of the J3 and DB values for single-unit cluster 
quality between optogenetically identified PV cells (n = 36) or SOM* cells (1 = 20) 
and random noise clusters in channels without units (n = 58), respectively. Owing 
toa non-normal distribution, the Kruskal-Wallis one-way ANOVA on ranks test 
was used and revealed significant differences between the different cell groups, both 
for the J3 (H = 86.491, 2 d.f., P< 0.001) and the DB (H = 86.275, 2 d.f., P< 0.001) 
values. Pairwise multiple comparisons (Dunn’s method) revealed significant differ- 
ences between PV“ cells and control and between SOM“ cells and control for both 
J3 and DB (all P< 0.05). 

Extended Data Fig. le: linear correlations were calculated for optogenetically 
identified PV~ cells (n = 36), SOM" cells (n = 20) and between waveforms from 
different and random non-light responsive units (n = 72) over different behavioural 
sessions. Plotted are the maximum r values. 

Extended Data Fig. 2a: comparisons of baseline freezing levels with and with- 
out optogenetic stimulation of PV* or SOM" cells. Two-tailed, paired Student’s 
t-tests did not reveal any significant differences between baseline freezing and 
stimulation in any of the conditions (P = 0.56 for PV~-ChR (n = 31), P= 0.9 for 
SOM*-ChR (n= 7), P= 0.82 for PV'-ARCH (n= 17), P= 0.27 for SOM" - 
ARCH (n = 6)). 


Extended Data Fig. 2b: effects of all performed optogenetic manipulations of 
either PV* or SOM’ cells on the acquisition of fear. For each condition the inter- 
nal control CS1 was compared with the CS2, which was paired with optogenetic stim- 
ulation during conditioning. Two-tailed, paired Student’s t-tests revealed significant 
differences for all conditions, except for the CS+US control group (P < 0.001 for 
PV*-ChR CS+US (n= 11), P= 0.86 for PV*-ChR CS+US control (n = 10), 
P<0.05 for PV*-ChR US (n = 11), P< 0.001 for PV*-ARCH US (n = 9), P<.0.05 
for PV*-ChR CS (n= 10), P<0.05 for PV'-ARCH CS (n = 8), P<0.01 for 
SOM*-ChR CS (n= 10) and P< 0.05 for SOM*-ARCH CS (n = 8)). 

Extended Data Fig. 4d: comparisons of the spike half-width and the spontan- 
eous activity of optogenetically identified PV™ cells (n = 36), SOM" cells (n = 20) 
and non-light-responsive units (n = 142). Owing to a non-normal distribution, a 
Kruskal-Wallis one-way ANOVA on ranks was used and indicated statistically 
significant differences between the groups for both parameters (spike half-width: 
H = 26.347, 2 d.f., P< 0.001; spontaneous activity: H = 34.518, 2 d.f, P< 0.001). 
Pairwise multiple comparisons (Dunn’s method) revealed significant differences 
for both parameters between PV~ and non-light responsive cells, as well as between 
PV* and SOM* cells (all P< 0.05). 

Extended Data Fig. 6e: comparison of freezing levels to CS1 (control) and CS2 
(with optogenetic manipulation during conditioning) for principal neuron inhib- 
ition (n = 8) during the CS. Two-tailed, paired Student’s t-tests revealed a signifi- 
cant difference (P < 0.01). 

Extended Data Fig. 8i: comparison of synaptic contacts made by SOM* and 
PV* interneurons at principal neuron somata (SOM: n = 8; PV: n = 7). Two-tailed, 
paired Student’s t-test (Welch-corrected) revealed a significant difference (P < 0.05). 

Extended Data Fig. 8j: comparison of extrapolated synaptic contacts made by 
SOM" and PV* interneurons at the estimated whole dendritic tree of principal 
neurons (SOM: n = 8; PV: n = 7). Two-tailed, paired Student’s t-test revealed a 
significant difference (P < 0.001). 

Extended Data Fig. 8k: comparison of synaptic contacts made by SOM* and 
PV* interneurons at principal neuron dendrites, normalized to the somatic con- 
tacts (SOM: n = 8; PV: n = 7). Two-tailed, paired Student’s t-test (Welch-corrected) 
revealed a significant difference (P < 0.001). 

Extended Data Fig. 8]: comparison of synaptic contacts made by SOM* and 
PV* interneurons per 50 jm of principal neuron dendrites, depending on the dis- 
tance from the soma (SOM: n = 8; PV: n = 7). Two-tailed, paired Student’s t-tests 
(Welch-corrected for the 200 um comparison) revealed significant differences (for 
50 um: P< 0.001; for 100 um: P < 0.001; for 200 um: P < 0.01). 

Extended Data Fig. 10b: comparisons of the burst rate and the burst duration in 
putative principal neurons with and without optogenetic manipulation of PV* or 
SOM‘ cells. Bursts were detected using the Poisson surprise method (see above). 
Data was normalized to control conditions without light. Two-tailed, paired Student’s 
t-tests revealed significant differences (for burst rate: P< 0.05 for PV-ChR2; P < 0.001 
for SOM-ChR2; for burst duration: P < 0.05 for SOM—ARCH; P < 0.001 for PV- 
ChR2 and SOM-ChR2). 
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1: Optogenetic identification 
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3: Fear Conditioning 


Extended Data Figure 1 | Fibre and electrode placement sites and 
single-unit recording quality. a, Position of electrode tips for single-unit 
recordings and of bilateral optic fibres for all PV-Cre and SOM-Cre animals 
used. b, Left, superimposed waveforms of three single units recorded on the 
same channel in the BLA of a PV-Cre animal. Right, spikes from individual 
units were sorted into clusters, using 3D principal component analysis. 

c, Recording quality was evaluated by calculation of J3 and Davies Bouldin 
validity index (DB) statistics for PV" and SOM* interneurons. To obtain 
control values two clusters were selected in the central noise cloud in channels 
without units. High J3 and low DB values indicate good isolation of single units. 
d, Left, principal component (PC) space cylinders to determine the stability 
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of clustered waveforms during long-term recordings. Straight cylinders 
indicate stable recordings of the same set of single units. Right, superimposed 
waveforms recorded during the different sessions, used for the calculation of 
the PC space cylinders. e, For a quantitative evaluation of spike shape similarity 
over different recording days, we calculated linear correlation values for 

PV‘ and SOM‘ interneurons. As a control, we calculated the r values for 
waveforms of different random neurons. The maximum r value across 
recording sessions was used to quantify similarity (r = 1 indicates identical 
spike shapes). All PV* and SOM” interneurons had r values above 0.95, 

but only about 4% of the random comparisons. Values are mean + s.e.m. 

*P < 0.05. Statistical analysis in Methods. 
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manipulations of PV~ or SOM* cell activity on baseline freezing levels. Freezing levels are compared between the internal control CS1 and CS2 which 


Illumination patterns resemble stimulations during the CS in the conditioning _ was paired with optogenetic stimulation during the conditioning. 
paradigms (10s of 300-ms blue light pulses at 0.9 Hz or 10s continuous yellow 
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Extended Data Figure 3 | Optogenetic identification and manipulation of 
PV* and SOM* interneurons. a, b, Top left: ex vivo whole-cell patch clamp 
and cell-attached recordings of ChR2-mCherry-expressing PV* (a) and 
SOM* (b) BLA interneurons in amygdala slices. Top right: distinct firing 
patterns of PV~ (a) and SOM” (b) cells in response to depolarizing somatic 
current injection. Bottom: PV* (a) and SOM* (b) cells fire brief bursts of 
action potentials in response to short blue light stimulation (468 nm, 5 ms, 

10 mW) and show sustained firing during prolonged stimulation (468 nm, 
300 ms, 10 mW, cell-attached recordings). c, Single-unit recordings of 
optogenetically identified PV* (green) and SOM" (red) interneurons upon 
300-ms stimulation with blue (ChR2, top) or yellow (ARCH, bottom) light in 
behaving animals. Left: Z-scored activity. Right: firing frequency. d, Left: 
comparison of spontaneous and light-evoked superimposed average spike 
waveforms of all PV~ interneurons identified by optogenetic activation. 
Middle: linear correlations between spontaneous and light-evoked spikes were 
calculated for individual optogenetically identified PV interneurons. Only 
cells with r values above 0.95 were considered as directly light-activated. Right: 
jitter of first light-evoked spikes in identified PV“ cells. e, Change point analysis 


for determination of the latency of light-evoked activity changes. The 
cumulative sum of the activity of a neuron was calculated and the change point 
was determined. Shown is an example of ARCH-mediated inhibition for the 
entire illumination period (left) and at the light onset (right). The arrow 
indicates the change point. f, Expression of ChR2, co-expressed with 

Venus (green), is restricted to SOM" interneurons (red). g, Specificity of 
opsin-expression. Left: examples of opsin expression in a PV-Cre (top) and a 
SOM-Cre (bottom) animal with immunohistochemistry (IHC) for the 
respective other interneuron marker. Right: quantification of co-localization of 
opsin-expressing and IHC-labelled cells. h, Light-induced changes in activity of 
SOM" interneurons upon illumination with either blue or yellow light for 
activation of ChR2 or ARCH, respectively. i, Latencies and Z scores of 
light-induced activity changes in optogenetically identified SOM * 
interneurons. j, Left: comparison of average waveforms of spontaneous and 
light-evoked spikes in SOM” interneurons. Middle: linear correlations between 
spontaneous and light-evoked spikes in optogenetically identified SOM 
interneurons. Right: jitter of first light-evoked spikes in identified SOM* cells. 
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mainly comprised of principal cells, but could also include unidentified similarity of SOM" interneurons with non-light-responsive cells. Values are 
interneurons. Squares indicate average values for the three groups. SOM* mean + s.e.m. *P < 0.05. Statistical analysis in Methods. 
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Extended Data Figure 5 | US and CS responses in PV* and SOM* 
interneurons in the BLA. a-d, Z-scored population activity (left) and firing 
frequency (right) for identified pv~ (green) and SOM‘ (red) cells in response 
to the US (a, b) or the CS (c, d). Shown are the population responses for all 
cells (top graphs). The graphs below show population responses that were 
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separated according to whether the cells showed inhibitory or excitatory 
responses to the stimuli. Cells were included in the stimulus inhibited group or 
stimulus excited group if their average Z-scored stimulus response was <0 or 
>0, respectively. 
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Extended Data Figure 6 | Effects of optogenetic manipulations of PV* 
interneuron activity on footshock responses of putative principal neurons. 
a, Footshock-evoked responses in footshock-excited putative principal neurons 
with (blue or yellow) and without (black) optogenetic stimulation of pvt 
interneurons. Cells are split based on being excited either exclusively during the 
footshock alone, exclusively during the footshock+light or during both 
conditions. b, Cumulative probability distributions of the average Z score of 
individual putative principal neurons during the footshock without (black) 
and with optogenetic manipulations of PV~ interneurons. c, Cumulative 
probability distributions of the change index (CI) as a measure for the 
light-induced changes in footshock responses for the different optogenetic 
interneuron manipulations. The change index is calculated as CI = 

(Z score(footshock+ light) —Z score(footshock))/(Z score(footshock-+ light) + 
Z score(footshock)) (all Z scores are the averages of the shock response during 


the entire footshock). Shown are the distributions for the footshock-excited 
cells (black), for the footshock+ light excited cells (blue or yellow), for cells 
which are excited in both conditions (green) and the weighted sum (grey). 

d, Left, comparisons of cumulative probability distributions for the differences 
in average footshock Z score during the entire footshock ((footshock+light) - 
footshock) in putative principal neurons for PV* interneuron activation (blue) 
and inhibition (yellow). Right, comparisons of cumulative probability 
distributions for the change index of the footshock responses of putative 
principal neurons upon optogenetic activation (blue) or inhibition (yellow) of 
PV* interneurons. e, Inhibition of BLA principal neurons during the US 
impairs fear acquisition. Principal neurons were selectively inhibited by 
expression of ARCH under the control of the CaMKII promoter and 
illumination with yellow light during the US. Values are mean + s.e.m. 

**P < 0.01. Statistical analysis in Methods. 
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Extended Data Figure 7 | Inhibitory inputs from PV* to SOM* as well as to 
unidentified GABAergic neurons. a, Slice patch-clamp experiments. Left: 
BLA location of PV™ interneurons expressing ChR2-mCherry in acute slices of 
PV-Cre::GAD-EGFP animals. Right: whole-cell recording (middle) of neuron 
expressing GAD (top) but not ChR2 (bottom). (Original magnification: left: 
x5, right: X60.) b, Optogenetic stimulation of PV* neurons resulted in 
inhibitory postsynaptic currents (IPSCs) in GAD-EGFP* neurons (n = 26, 
5-ms blue light). IPSCs reverse at about 0 mV (voltage clamp Cl reversal 
potential, n = 4). c, Left: fractions of patched and identified SOM‘ cells with 
and without detected PV* input. Owing to potential wash-out of the antigen 
(SOM) during the whole-cell recordings, these data probably represent a solid 


10 ms 


-62 mV 
Failed: 10 / 20 


lower bound rather than the actual connectivity rate (see Methods). Right: IPSC 
amplitude, peak latency after blue light onset and 10-90% rise time of PV inputs 
on all measured GABAergic (PV-GAD, n = 26) and somatostatin-positive 
(PV-SOM, n = 3) neurons, respectively. 1 SOM? cell with PV inputs was 
tested with a current clamp internal at —40 mV and thus does not contribute to 
the statistics. d, e, Suppression of spikes in a SOM* cell by optogenetic 
activation of PV* cells. d, individual trials of somatic current injection into a 
SOM‘ cell with and without PV* activation. Red traces show spike failures on 
PV" stimulation. e, overlay of current injections with spiking success and 
failures without (top) and with (bottom) simultaneous PV~ cell activation. 
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Extended Data Figure 8 | PV* and SOM* interneurons contact different 
compartments of pyramidal neurons. a, Maximum intensity projection 
(MIP) showing a biocytin-filled principal neuron (grey) and conditional 
expression of the presynaptic marker synaptophysin-GFP (Syn-GFP, red) in 
SOM" interneurons in the BLA. b, Soma of the same principal neuron (grey) 
with expression of synaptophysin-GEFP (red) selectively in SOM” interneurons 
(MIP). Arrowhead points to 1 out of 7 identified synaptic contacts at the 
soma revealed by 3D analysis of confocal images. c, Dendritic branch located 
200 um distant from the soma (MIP, yellow rectangle in a). Arrowheads mark 
synaptic contacts of SOM™ interneurons with the principal neuron dendrite 
identified in 3D analysis. d, Single focal planes of contacts labelled in b, c. 

e, Biocytin-filled principal neuron and conditional synaptophysin-GFP 
expression (green) in PV‘ interneurons in the BLA (MIP). f, Soma of the same 
principal neuron with presynaptic synaptophysin-GFP expression selectively 
in PV interneurons (MIP). Arrowheads point to 3 out of 53 synaptic contacts 
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from PV* interneurons identified in 3D analysis at this soma. g, Dendrite 
located 200 Lm distant from the soma (MIP, yellow rectangle in e). Arrowhead 
marks a synaptic contact from PV* interneurons. h, Single section images of 
appositions labelled in f, g. Scale bars: 100 jum (a, e), 5 um (b, f), 2 [um (¢, g) 
and 0.5 um (d, h). i, Quantification of synaptic contacts from SOM* and PV* 
interneurons with the soma of principal neurons in the BLA. j, Number of 
extrapolated synaptic appositions from SOM* and PV* interneurons onto the 
estimated dendritic tree of principal neurons. k, Normalization of dendritic 
versus somatic appositions shows a highly significant difference between 
postsynaptic targeting of SOM* and PV™ interneurons. I, Distance 
dependency of synaptic inputs from SOM” (red, n = 8) and PV™ interneurons 
(green, n = 7) onto principal neuron dendrites. SOM* interneurons 
preferentially target distal dendrites of principal neurons. All values are 
mean + s.e.m. *P< 0.05, **P< 0.01, ***P < 0.001. Statistical analysis 

in Methods. 
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Extended Data Figure 9 | Effects of optogenetic manipulations of PV* or 
SOM‘ interneuron activity on tone-evoked responses in putative principal 
neurons. a, Tone-evoked responses in tone-excited putative principal neurons 
with (blue or yellow) and without (black) optogenetic stimulation of PV* or 
SOM? interneurons. Cells are split based on being tone excited either 
exclusively during the tone alone, exclusively during tone+light or in both 
conditions. b, Cumulative probability distributions of the change index (CI) as 
a measure for the light-induced changes in tone responses for the different 


optogenetic interneuron manipulations. The change index is calculated as 

CI = (Z score(tone+light) — Z score(tone))/(Z score(tone+light) + 

Z score(tone)) (all Z scores are the averages of the tone response during the pip). 
Shown are the distributions for the tone-excited cells (black), for the tone+light 
excited cells (blue or yellow), for cells which are excited in both conditions 
(green) and the weighted sum (grey). c, Comparisons of cumulative probability 
distributions for the change index of the tone responses of putative principal 
neurons upon optogenetic stimulation of PV* or SOM* cells. 
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Extended Data Figure 10 | Effects of optogenetic manipulations of PV* or 
SOM‘ interneurons on spontaneous activity and burst firing of putative 
principal neurons. a, Effects of optogenetic activation (top) or inhibition 
(bottom) of PV* (grey) or SOM™ (black) cells on the spontaneous activity of 
putative principal neurons. b, Effects of optogenetic manipulations of PV” or 
SOM” cells on the burst frequency (top) and burst duration (bottom) in 
putative principal neurons. c, Comparisons of cumulative probability 
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distributions for light-induced changes in spontaneous activity in putative 
principal neurons on optogenetic stimulation of PV* or SOM“ interneurons. 
d, Comparisons of cumulative probability distributions for light-induced 
changes in burst rate in putative principal neurons on optogenetic stimulation 
of PV‘ or SOM" interneurons. Values are mean + s.e.m. *P < 0.05, 

***P <0).001. Statistical analysis in Methods. 
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Contrasting forms of cocaine-evoked 
plasticity control components of relapse 


Vincent Pascoli!*, Jean Terrier’, Julie Espallergues”*"*, Emmanuel Valjent?>"*, Eoin Cornelius O’Connor! & Christian Liischer!* 


Nucleus accumbens neurons serve to integrate information from cortical and limbic regions to direct behaviour. Addic- 
tive drugs are proposed to hijack this system, enabling drug-associated cues to trigger relapse to drug seeking. However, 
the connections affected and proof of causality remain to be established. Here we use a mouse model of delayed cue- 
associated cocaine seeking with ex vivo electrophysiology in optogenetically delineated circuits. We find that seeking 
correlates with rectifying AMPA (a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptor transmission and 
a reduced AMPA/NMDA (N-methyl-p-aspartate) ratio at medial prefrontal cortex (mPFC) to nucleus accumbens shell 
D1-receptor medium-sized spiny neurons (DIR-MSNs). In contrast, the AMPA/NMDA ratio increases at ventral hippo- 
campus to DIR-MSNs. Optogenetic reversal of cocaine-evoked plasticity at both inputs abolishes seeking, whereas selec- 
tive reversal at mPFC or ventral hippocampus synapses impairs response discrimination or reduces response vigour during 
seeking, respectively. Taken together, we describe how information integration in the nucleus accumbens is comman- 
deered by cocaine at discrete synapses to allow relapse. Our approach holds promise for identifying synaptic causalities in 


other behavioural disorders. 


The nucleus accumbens (NAc) is a point of convergence for excitatory 
afferents arising from limbic and cortical regions, including the baso- 
lateral amygdala (BLA), ventral hippocampus (vHipp) and the mPFC. 
From pharmacology, lesion and imaging experiments, each of these 
regions is thought to signal distinct information to the NAc during 
reward-related situations. Specifically, the BLA signals emotional valence, 
the vHipp provides contextual relevance, whereas the mPFC provides 
action—-outcome information’’. NAc neurons select and integrate infor- 
mation from these diffuse regions and signal to the basal ganglia motor 
system to guide appropriate behaviours, such as foraging in response to 
feeding relevant signals**. This neural circuitry is also involved in core 
features of drug addiction, such as craving and relapse in response to 
drug-associated cues after withdrawal periods**®. Understanding how 
this circuitry is ‘hijacked’ after drug experience and how drug-evoked 
alterations are causally related to drug-adaptive behaviours, such as 
relapse, is fundamental to defining the pathophysiology of addiction. 
The task of information processing and output in the NAc falls to 
MSNs, representing 95% of NAc neurons and broadly divisible into 
two equally sized classes according to the dopamine receptors expressed’. 
Both NAc D1 receptor (D1R)-MSN and D2 receptor (D2R)-MSN sub- 
types receive excitatory afferents from the BLA, vHipp and mPFC*, but 
differ in their projection targets*’. Owing to the arrangement of syn- 
apses on MSN spines, excitatory transmission arriving onto the spine 
head is subject to strong modulation by dopamine from the ventral teg- 
mental area arriving at the spine neck"®. Indeed, dopamine gates excit- 
atory transmission, allowing synaptic adaptations and modulation of 
reward-related behaviours’. Accumbal dopamine transients induced 
by addictive drugs are probably key for inducing circuit adaptations 
that divert behaviour towards compulsive drug seeking and heighten 
the risk of relapse after prolonged withdrawal'*"’. Relapse is associated 
with exposure to drug cues, it is context dependent and requires knowl- 
edge of what actions result in drug delivery’. Therefore, this behaviour 
is likely underpinned by a memory trace, formed in the reward circuitry 


during drug use’*’’. However, the nature of this trace and the causal impli- 
cations remain elusive. 

Much literature has described induction requirements and expression 
mechanisms for drug-evoked plasticity at excitatory synapses, including 
those of the NAc”. The time after the last drug exposure is important, 
as the expression of some forms of plasticity can take days or weeks'*”. 
Therefore, establishing a causal link between drug-evoked plasticity and 
drug-adaptive behaviours has been difficult. Pharmacological manip- 
ulation of AMPA receptors (AMPARs) implicates NAc glutamatergic 
transmission in cue-associated cocaine seeking”””’, but many questions 
remain. Do different excitatory inputs change in the same way onto D1R- 
and D2R-MSNs after withdrawal from cocaine self-administration? 
Could drug-evoked alterations at specific inputs be causally linked to 
specific components of drug-adaptive behaviours, such as relapse? Here 
we use cell-type-specific reporter lines and optogenetics to address these 
questions, taking an approach that may also be applied in other behavi- 
oural disorders to establish synaptic causalities (Extended Data Fig. 1). 


Cocaine-evoked plasticity at DIR-MSNs 


To characterize drug-evoked plasticity in identified NAc MSNs ata time 
point when relapse can be observed, bacterial artificial chromosome (BAC) 
transgenic mice expressing fluorescent proteins under the control of 
the D1R or D2R promoter were first trained to self-administer intrave- 
nous cocaine”. Control mice that received only intravenous saline infu- 
sions quickly stopped responding, whereas mice that self-administered 
cocaine learned to discriminate between the active and inactive levers 
and responded almost exclusively on the active lever at the end of the 
acquisition phase, confirming successful acquisition of cocaine self- 
administration (Fig. 1a, b). 

Thirty days after the last self-administration session, whole-cell record- 
ings were made ex vivo in identified NAc MSNs with excitatory post- 
synaptic currents (EPSCs) elicited by electrical stimulation to obtain a 
global overview of cocaine-evoked plasticity (Fig. 1c). The rectification 
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index of AMPAR-EPSCs was increased selectively in D1R-MSNs from 
mice that self-administered cocaine, suggesting the presence of GluA2- 
lacking calcium-permeable AMPARs (CP-AMPARs”*). The ratio of 
the amplitude of AMPAR- and NMDAR-EPSCs (A/N ratio) recorded 
in the same cells, which provides a measure of synaptic strength, was 
also increased only in D1R-MSNs from mice that self-administered 
cocaine (Fig. 1c). At first glance, it was surprising to observe these two 
forms of plasticity in the same neuron, because CP-AMPARs show reduced 
conductance at +40 mV that would produce a decreased A/N ratio, if 
NMDA were unchanged. One potential explanation is that cocaine 
induced several contrasting forms of plasticity that segregate between 
different inputs onto the same D1R-MSNs. In support of this idea, a 
recent study reported an increased A/N ratio specifically at vHipp to 
NAc synapses after chronic non-contingent cocaine injections, with- 
out, however, resolving the MSN identity”. 


@ 
we eS 


ee RNa 
o v Saline 
= v Cocaine 


Saline self-administration b 
| Active lever press | Infusion | Inactive lever press 


1207 
pe ee | 
a 2 iit Weare wih ta a $8 FRI 
3 3 iil oll ti 2 8 804 
oc 41th Who 2 a 
Ss il loth ow bal | a. FR2 
5 6 ili i Ta Il a o 404 
3 fe QD & 
gz 4 i ; 8 0: 
10 | 12345678910 


o Saline 


Cocaine self-administration 
454 « Cocaine 


» itil Witt 1 li 1 hi et tui ira ey FR2 
Ss 2 i il ull Pu Lab tliat atl Wl}]> c¢ 304 
8 a i In Mit Ml at! mn ii ili py i "2 3 aaa aa 
Wl MN 
6 5 1 UN LEH LTE itil ee 2 154 FRI 
2 6 mde daiviimtil i dota titetay di dit - a ee 
2S 7A TEN tt ee ot 
3 8 Ml t PMT RETA i 1 TTR IE i 
2 9 ) | | | ny 
Codd tilde PEC ti ie al 12345678910 
f T T T 1 Time (d) 
0 30 60 90 120 
Time (min) 3.5 3 
~3 D1R-MSNs 
c Saline or Slice 1.8 ence 
cocaine self-administration recordings = ee 
< @ 
0.9 Iz. % |T 
Ca ee Oe eo ie r &® % cy 
1234567 8910 0 
Day 40 
FR1 FR2 0 1 2 3 
3.5 
Beat D1R-MSNs D2R-MSNs re D2R-MSNs 
NN o Saline eCocaine oSaline eCocaine = 
a < 
0.9 Sram? 5 
¥e: 
0 


0 1 2 3 
Rectification index 


Figure 1 | Withdrawal from cocaine self-administration evokes cell-type- 
specific modifications of excitatory synapses in the NAc. a, Raster plot for 
infusions, active and inactive lever presses as a function of time during 
acquisition of self-administration for a mouse that self-administered saline 
(top) or cocaine (bottom). FR1, FR2, fixed-ratio one and two schedules. 

b, Mean total lever presses (top) and infusions (bottom) during the acquisition 
phase of saline (n = 42) or cocaine (n = 164) self-administration for all mice 
used in the study. c, Schematic of experiment (top). Sample traces of AMPAR- 
EPSCs at —70, 0 and +40 mV (coloured) and NMDAR-EPSCs at +40 mV 
(black) (bottom) and plots (right) of A/N ratio as a function of rectification 
index (small dots) for each D1R- (n = 9/41, saline/cocaine) or D2R-MSN 

(n = 8/25). Large open and filled dots represent group mean saline and cocaine 
data, respectively. Cocaine increased rectification index and A/N ratio in 
D1R-MSNs but not in D2R-MSNs (rectification index in D1IR- and 
D2R-MSNs: effect of group (cocaine/saline), F,g3 = 10.86, P< 0.01; cell 

type X group, F)g3 = 6.08, P < 0.05. Student’s t-test for cocaine versus saline in 
D1R-MSNs, tag = 3.72, *P < 0.01; in D2R-MSNs, t3; = 0.80, P = 0.43. A/N 
ratio in DIR- and D2R-MSNs: effect of group, F),g3 = 5.91, P< 0.05; cell 
type X group, F,s3 = 8.34, P< 0.01. Student’s t-test for cocaine versus saline in 
DIR-MSNs, tig = 3.25, +P < 0.01, in D2R-MSNs, ts, = 0.62, P = 0.54). Scale 
bars, 20 pA, 20 ms. Plots, means with s.e.m. 
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Cocaine-evoked plasticity at identified inputs 


To explore the possibility that distinct afferents to the NAc undergo 
contrasting forms of plasticity after cocaine self-administration, the opto- 
genetic effector channelrhodopsin tagged with enhanced yellow fluores- 
cent protein (ChR2-eYFP) was virally expressed by stereotaxic injection 
into the BLA, vHipp or mPFC of BAC transgenic reporter mice (Fig. 2a). 
Retrograde labelling with cholera toxin subunit B in the dorso- or ven- 
tromedial NAc shell confirmed these regions as providing major affer- 
ents (Extended Data Fig. 2a), whereas anterograde labelling of different 
inputs with distinct fluorescent markers identified the juxtaposition 
of fibres in the NAc (Extended Data Fig. 3), in agreement with studies 
reporting the convergence of inputs onto the same MSN**’”. Whole-cell 
recordings from mice infected with ChR2-eYFP in each region confirmed 
that inputs were excitatory (Extended Data Fig. 2c). Input and cell-type- 
specific plasticity were then evaluated ex vivo in the NAc shell from mice 
that self-administered cocaine or saline after 1 month of withdrawal. 
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Figure 2 | Withdrawal from cocaine self-administration evokes input- 
specific plasticity in NAc DIR-MSNs. a, Top, images of BLA, vHipp and 
mPFC infected with ChR2-eYFP, with example traces and mean peak 
amplitudes of photocurrents (100 ms light pulses). Scale bars, 50 ms, 50 pA. 
Bottom, confocal image of NAc shell from a Drdla-tdTomato mouse infected 
with ChR2 in the mPFC, as used for input and cell-type-specific recordings. Cell 
nuclei are stained with Hoechst. Scale bar, 100 jum. b, Top, schematic of whole- 
cell recordings of DIR-MSNs from mice infected with ChR2 in the BLA (left), 
vHipp (centre) or mPFC (right). Bottom, cocaine (coc) self-administration 
increased the mean amplitude of 4 ms light-evoked EPSCs at vHipp and mPFC 
synapses (t6) = 3.12, *P < 0.01, n = 30/34 for saline (sal)/cocaine and 

ti20 = 3.0, *P < 0.01, n = 66/56, respectively), but did not change BLA synapses 
(tig = 0.88, P = 0.39; n = 11/9). ¢, For each input, example traces are shown 
(top), with I/V plots (bottom) and group mean rectification index data (right). 
Cocaine decreased normalized AMPAR-EPSCs at +40 mV from mPFC 

(tio = 4.16, *P < 0.01, = 8/13 for saline/cocaine) but not from BLA or vHipp 
inputs (t); = 0.67, P = 0.52, n = 5/8 and ty = 0.18, P = 0.86, n = 9/13, 
respectively). The rectification index was increased at mPFC synapses 

(tig = 6.9, *P = 0.001), but not at BLA or vHipp synapses (t); = 0.58, P = 0.57 
and ty = 0.22, P = 0.83, respectively). Scale bars, 20 ms, 20 pA. d, The A/N 
ratio was decreased by cocaine self-administration at mPFC synapses 

(tio = —3.59, P< 0.01), but increased at vHipp synapses (t29 = 8.33, P< 0.001) 
and was unchanged at BLA synapses (t,; = 0.94, P = 0.36). *P < 0.05. Scale 
bars, 20 ms, 20 pA. Error bars, s.e.m. 
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Recordings of light-evoked AMPAR-EPSCs first confirmed that the 
strongest input onto the NAc arises from the hippocampus™ and that 
cocaine self-administration evoked a significant potentiation of AMPAR- 
EPSC amplitudes only at VHipp and mPFC inputs onto DIR-MSNs 
(Fig. 2b). No change was observed at any input onto D2R-MSNs (Extended. 
Data Fig. 4a). Remarkably, rectification of the current-voltage (I/V) 
curve was detected only at mPFC to DIR-MSN synapses in mice that 
self-administered cocaine, but not at vVHipp or BLA to NAc synapses 
(Fig. 2c). Thus, cocaine self-administration triggered the insertion of 
CP-AMPARs only at mPFC to D1R-MSN synapses. Confirming find- 
ings with electrical stimulation, I/V curves were linear for all inputs 
onto D2R-MSNs in mice that self-administered cocaine (Extended Data 
Fig. 4b). For the A/N ratio, in mice that self-administered cocaine it was 
decreased at mPFC to DIR-MSN synapses (consistent with the poor 
conductance of CP-AMPARs at +40 mV), but increased at vHipp to 
D1R-MSN synapses and unaffected at BLA synapses (Fig. 2d). Cocaine 
self-administration evoked no change in the A/N ratio at any input onto 
D2R-MSNs (Extended Data Fig. 4c). Note that in baseline conditions 
(that is, mice that self-administered saline) both the rectification index 
and AMPAR/NMDA ratios were similar across all inputs onto D1R- 
and D2R-MSNs, as predicted from electrical recordings. Collectively, 
these results demonstrate that cocaine self-administration and with- 
drawal results in contrasting forms of plasticity at specific inputs onto 
the same accumbal D1R-MSN. 


Homosynaptic effects of optogenetic protocols 


Having identified specific inputs onto NAc shell D1R-MSNs that express 
drug-evoked plasticity, we wanted to test for a causal link to cue-associated 
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Figure 3 | Homosynaptic effects of optogenetic protocols applied ex vivo on 
D1R-MSN plasticity. a, Schematic of experiment for b-e; ChR2 in vHipp. 
b, Graph of normalized light-evoked EPSCs across time (bottom: each point 
represents mean of six sweeps), with example traces (top: mean of 20 sweeps) 
before (1) and after (2) the LTD protocol (4 ms pulses at 1 Hz, 10 min). One 
month after cocaine self-administration, the efficiency of the 1 Hz LTD 
protocol was not modified (56.3 + 14.7% to 38.6 + 12.2%, t7 = 0.89, P = 0.4; 
n= 5/4). ¢, As for b, but with a 13 Hz protocol (4 ms pulses at 13 Hz, 10 min). 
The efficiency of this protocol was reduced in mice that self-administered 
cocaine (39.2 + 6.7% to 69.5 + 10.1%, to = 2.57, *P = 0.03; n = 6/5). 

d, Example traces (left) of light-evoked AMPAR-EPSCs recorded at —70, 0 and 
+40 mV, with I/V plot (middle) and group mean rectification index data 
(right) before (Ctrl) and after the 1 or 13 Hz light protocol, both in saline (s) and 
cocaine (c) self-administered groups. AMPAR composition was not modified 
by cocaine or light protocols (saline/cocaine for Ctrl (n = 9/13), 1 Hz (6/4), 
13 Hz (8/5)). e, For the same cells in d, example traces (left), with mean A/N 
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drug seeking. To this end, we aimed to develop optogenetic protocols 
ex vivo that could restore basal synaptic transmission (Fig. 3a, f). How- 
ever, more than a complete erasure of cocaine-evoked plasticity, we sought 
to parse the relationship between plasticity at identified synapses to cue- 
associated seeking behaviour. 

Because synapses were essentially potentiated after cocaine self- 
administration, optogenetic protocols were used that could trigger long- 
term depression (LTD). When recording light-evoked AMPAR-EPSCs 
selectively at VHipp to DIR-MSNs, application of a 1 Hz NMDAR- 
dependent LTD protocol” significantly depressed EPSCs both in saline 
and in cocaine self-administration groups (Fig. 3b). At the same input, a 
13 Hz extrasynaptic metabotropic glutamate receptor (mGluR)-dependent 
LTD protocol (Extended Data Fig. 5a’) triggered a depression in the 
cocaine self-administration group that was significantly attenuated com- 
pared with controls (Fig. 3c). The consequence of applying either pro- 
tocol on cocaine-evoked plasticity at VHipp to NAc synapses (that is, 
homosynaptic effects) was then examined. As described above, CP- 
AMPARs were not present at vHipp to NAc synapses after cocaine self- 
administration and neither the 1 nor 13 Hz vHipp protocol further altered 
the linear I/V curve or rectification index at this input (Fig. 3d). Remark- 
ably, however, the 1 Hz, but not the 13 Hz, vHipp protocol normalized 
the cocaine-evoked increase in the A/N ratio at this input (Fig. 3e). Note 
that in the group that self-administered saline, the 1 or 13 Hz vHipp 
protocols had little impact on the A/N ratio at these synapses, which 
was already low at baseline (Fig. 3e). 

These experiments were repeated, but now isolating mPFC to D1R- 
MSN synapses (Fig. 3f). The 1 Hz mPFC protocol only slightly depressed 
AMPAR-EPSCs in the cocaine self-administration group, whereas the 
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ratios (right) before or after the 1 or 13 Hz protocol. Planned comparisons, 
after analysis of variance (ANOVA), by t-test, t,*P = 0.05. f, Schematic of 
experiment for g-j; ChR2 in the mPFC. g, The 1 Hz protocol efficiency was 
reduced in mice that self-administered cocaine (43 + 5.5% to 74 + 3.7%, 

ty = 4.9, *P < 0.001; n = 8/11). h, The 13 Hz protocol efficiency was increased 
in mice that self-administered cocaine (54 + 6.1% to 29 + 4.7%, to. = 3.25, 
*P <0.01; n = 13/11). i, The 13 Hz but not 1 Hz protocol normalized 
cocaine-evoked changes in AMPAR-EPSCs at +40 mV and rectification index. 
Planned comparisons, after ANOVA, by t-test, ¢,*P = 0.05; (saline/cocaine for 
Ctrl (n = 9/13), 1 Hz (10/7), 13 Hz (10/6)). j, For the same cells as i, the 
cocaine-evoked decrease in A/N ratio was normalized by the 13 Hz, but not the 
1 Hz, protocol. The 1 Hz protocol decreased A/N ratio in saline mice 

(#P < 0.05). Planned comparisons, after ANOVA, by t-test, +,*P = 0.05. Note 
that control data are the same as shown in Fig. 2. Scale bars, 20 pA, 20 ms. Error 
bars, s.e.m. 
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depression was significantly larger in controls (Fig. 3g). This finding 
concurs with a previous study where NMDAR-dependent LTD was altered 
in ‘cocaine-addicted’ rats*®. The 13 Hz mPFC protocol was actually more 
efficient in the cocaine self-administration group (Fig. 3h), which is of 
particular interest because mGluR activation can remove CP-AMPARs 
at many synapses throughout the brain’’~*’. Both protocols triggered 
comparable LTD of AMPAR-EPSCs in D2R-MSNs from saline and 
mice that self-administered cocaine (Extended Data Fig. 5b). The homo- 
synaptic effects of these protocols on cocaine-evoked plasticity at mPFC 
to NAc synapses was then examined which, as described above, indi- 
cated the presence of CP-AMPARs. Consistent with LTD experiments, 
the 13 Hz, but not the 1 Hz, mPFC protocol normalized both the rec- 
tifying I/V curve and the depressed A/N ratio (Fig. 3i, j), pointing to 
removal of CP-AMPARs from mPFC to D1IR-MSN synapses. In the 
group that self-administered saline, the 1 Hz, but not the 13 Hz, mPFC 
protocol significantly reduced the A/N ratio, whereas the rectification 
index remained unchanged in either case. 


Heterosynaptic effects of optogenetic protocols 


After evaluating the 1 and 13 Hz protocols on light-evoked homosy- 
naptic transmission, the possibility of heterosynaptic effects was explored 
ex vivo (that is, normalization of cocaine-evoked plasticity at an input 
other than which the protocol was applied). Indeed, in vivo recordings 
have demonstrated that activation of one excitatory NAc input can 
trigger heterosynaptic plasticity”. 

Optogenetic LTD protocols were applied onto slices at vHipp or 
mPFC to NAc inputs but EPSCs were evoked with electrical stimu- 
lation, thus recruiting transmission from multiple afferents (Fig. 4a). The 
cocaine-evoked rectification of AMPAR transmission was restored by 
the 13 Hz protocol, whether applied at VHipp or mPFC to NAc synapses 
(Fig. 4b). Because rectification was only found at mPFC to DIR-MSN 
synapses, the 13 Hz vHipp protocol must have triggered normalization 
of this synapse heterosynaptically, most probably by activating mGluRs 
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Figure 4 | Heterosynaptic effects of optogenetic protocols applied ex vivo on 
D1R-MSN plasticity. a, Schematic of experiment. b, Example traces (left) of 
electrically evoked AMPAR-EPSCs recorded at —70, 0 and +40 mV, with I/V 
plots (middle) and grouped mean rectification index data (right) before (Ctrl) 
or after light protocols (1 or 13 Hz) were applied with ChR2 in the vHipp or 
mPFC. The 13 Hz but not 1 Hz light protocol applied at vVHipp or mPFC 
synapses normalized AMPA-EPSCs at +40 mV and rectification index 
(planned comparisons, after ANOVAs, with t-tests: *,+P =< 0.05; (saline/ 
cocaine for Ctrl (n = 9/41), for vHipp 1 Hz (6/8) and 13 Hz (8/8), for mPFC 
1 Hz (12/9) and 13 Hz (15/8))). ¢, For the same cells shown in b, example traces 
(left) and group mean A/N ratios (right) before or after the 1 or 13 Hz protocols. 
The 1 Hz vHipp or the 13 Hz mPFC protocol normalized the cocaine-evoked 
increase in A/N ratio, whereas the A/N ratio remained elevated after the 

13 Hz vHipp protocol or the 1 Hz mPFC protocol (planned comparisons, after 
ANOVAs, with t-tests: +,*P < 0.05 and #P = 0.07). Note that control data are 
the same as shown in Fig. 1. Scale bars, 20 pA, 20 ms. Error bars, s.e.m. 
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at neighbouring synapses through glutamate spillover or intracellular 
signalling. For the A/N ratio, which was increased in the cocaine self- 
administration group when recorded with electrical stimulation, nor- 
malization was observed using the 1 Hz vHipp protocol (Fig. 4c). This 
probably reflected a homosynaptic effect as reported above (see Fig. 3e). 
However, the 13 Hz mPFC protocol, in addition to removing rectifying 
AMPARs, also restored the A/N ratio; a heterosynaptic effect probably 
occurring through glutamate spillover activating NMDARs at vHipp 
to DIR-MSN synapses. Note that the 13 Hz vHipp protocol, in contrast 
to its heterosynaptic effect on the rectification index, did not elicit a 
homosynaptic reduction in the A/N ratio, possibly because NUDAR 
activation was too strong to elicit depression****. Because our initial 
validations were performed ex vivo, we also confirmed that optogenetic 
protocols were efficient in normalizing cocaine-evoked plasticity when 
applied in vivo (Extended Data Figs 6 and 7, which include extra dis- 
cussion). Taken together, this approach led to an advantageous situ- 
ation to test the causality of contrasting forms of drug-evoked plasticity 
in cue-associated cocaine seeking, because plasticity at mPFC or vHipp 
to DIR-MSNs could be restored separately using the using the 13 Hz 
vHipp or the 1 Hz vHipp protocol, respectively, or together by using the 
13 Hz mPFC protocol. 


Causal link to components of cocaine seeking 


Now we applied optogenetic protocols in self-administration mice to 
test directly the causality between drug-evoked plasticity at identified 
inputs and a drug-adaptive behaviour relevant to addiction. After 1 month 
of withdrawal from self-administration, mice underwent a cue-associated 
seeking test, a rodent model of relapse, with optogenetic protocols applied 
once, 4h before the test (Fig. 5a, b). 

The 1 Hz vHipp protocol (that is, reversing cocaine-evoked plasticity 
only at VHipp to D1R-MSNs) significantly reduced cue-associated lever 
pressing, a measure of response vigour, although cocaine seeking per se 
was still present. The 13 Hz vHipp protocol (that is, inducing a hetero- 
synaptic reversal of plasticity at mPFC to D1R-MSNs) led to increased 
inactive lever responding, pointing to impaired action-outcome response 
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Figure 5 | Linking cocaine-evoked plasticity at identified inputs to specific 
components of cue-associated cocaine seeking. a, Raster plots showing active 
and inactive lever presses during a cue-associated seeking session. b, Group 
mean data showing active (a) and inactive (i) lever presses. Seeking was robust 
in control mice that self-administered cocaine (active versus inactive lever, 
t43 = 9.5, *P < 0.001; n = 11/44, saline/cocaine; active lever to saline control 
group, after ANOVA, by t-test: t33 = 4.6, #P < 0.001). Seeking was present after 
the 1 Hz vHipp protocol (ty 9 = 3.6, *P < 0.01), but active lever responses 
were significantly diminished (versus cocaine control group, after ANOVA, by 
t-test: t53 = 2.3, {P< 0.05; n = 11/44). Lever discrimination was lost after the 
13 Hz vHipp protocol (tg = 1.99, P > 0.05), and inactive lever responses 

were significantly increased (versus cocaine control group, after ANOVA, 

by f-test: ts; = 4.48, P< 0.001; n = 9/44). Seeking was present after the 1 Hz 
mPFC protocol and did not differ from the cocaine controls (versus cocaine 
control groups by ANOVA: lever, protocol and lever X protocol interaction all 
non-significant; n = 11/44). Active lever responding was reduced after the 

13 Hz mPFC protocol (versus cocaine control group, after ANOVA, by t-test: 
ts2 = 4.3, TP < 0.001; n = 10/44) and lever discrimination was lost (active 
versus inactive levers by t-test, tp = 1.7, P> 0.1, n = 10). See Extended Data 
Table 1 for complete statistics. Error bars, s.e.m. 
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discrimination. The 1 Hz mPFC protocol (that is, no effect on cocaine- 
evoked plasticity) was without effect on seeking behaviour. Finally, the 
13 HzmPFC protocol (that is, normalizing both inputs) abolished seek- 
ing behaviour (Fig. 5a, b). To examine the persistence of this effect, the 
same mice were tested for cue-associated cocaine seeking 1 week later 
without further optogenetic intervention. Seeking behaviour remained 
absent in mice that received the 13 Hz mPFC protocol (Extended Data 
Figs 8a, b and 9). Taken together, these data demonstrate that cocaine- 
evoked plasticity at two inputs onto NAc D1R-MSNsis required for the 
complete expression of cue-associated seeking, whereas each form of 
contrasting plasticity is causally related to a different component of the 
seeking behaviour, namely the vigour of the seeking response and the 
ability to discriminate cocaine- from non-cocaine-directed actions. 
We performed extra tests to control for the behavioural specificity 
of optogenetic interventions. First, optogenetic removal of cocaine- 
evoked plasticity did not preclude the acquisition of further reward- 
related instrumental learning (Extended Data Fig. 8c, which includes 
further description). Second, the 13 Hz mPFC protocol did not affect 
cue-associated food seeking behaviour in a cohort of mice that were 
previously trained to self-administer sucrose pellets (Extended Data 
Fig. 8d-f, which includes further description), suggesting that this pro- 
tocol is selective for cocaine-evoked plasticity necessary for cocaine 


36,37 


seeking against plasticity that may correlate with food seeking? 


Discussion 


A striking feature of this study is the input-specific expression mech- 
anism of drug-evoked plasticity. In the same D1R-MSNs, synapses that 
belong to the mPFC input show rectifying AMPAR-EPSCs reflecting 
the insertion of GluA2-lacking CP-AMPARs, whereas at neighbouring 
vHipp synapses an increased A/N ratio and significantly larger EPSC 
amplitude indicates the insertion of more GluA2-containing AMPARs. 
Moreover, such forms of plasticity were absent at other synapses and 
in D2R-MSNs, although these neurons may still contribute to features 
of addiction**. Why are some synapses susceptible to cocaine-evoked 
plasticity and why do the expression mechanisms differ between inputs? 
These findings may reflect a complex induction process, requiring specific 
neuronal activity patterns coinciding with high levels of mesolimbic dopa- 
mine during drug exposure. Indeed, different drug self-administration 
histories may favour the induction of synaptic plasticity at other NAc 
inputs, such as the BLA”. Another possibility is that contrasting expression 
mechanisms reflect differences in the quality of basal synaptic transmission 
between cortical and limbic inputs onto NAc MSNs”, or differences in 
the sensitivity of synapses to be influenced by the coincidence of con- 
verging glutamate and dopamine signals involving D1R signalling”. 

Here we focus on the vHipp and mPFC input onto DIR-MSNs, tak- 
ing advantage of endogenous depression mechanisms (for example, 
NMDAR or mGluR activation”*”*“°) to restore normal transmission 
in each input separately or both inputs together. Although the 1 Hz LTD 
protocol is sufficient to recruit NMDARs on the activated input, trains 
of action potentials (for example, 13 Hz stimulation) are required to 
release sufficient amounts of glutamate to activate perisynaptically 
located mGluRs, even at neighbouring inputs. mGluR-LTD in the NAc 
is expressed both pre- and postsynaptically?**”, but what most likely 
matters is that mGluR-LTD is an efficient mechanism to remove GluA2- 
lacking AMPARs, just as in other central nervous system synapses”!”*”?. 
It is therefore not surprising that the 1 Hz mPFC protocol failed to nor- 
malize rectification index and had no behavioural effect. The 1 Hz vHipp 
protocol normalized the A/N ratio at VHipp to DIR-MSN synapses 
and significantly reduced the vigour with which the animal pressed the 
active lever in search of cocaine. Thus, cocaine-evoked plasticity at VHipp 
afferents to NAc DIR-MSNs may enhance recognition of the correct 
context in which cocaine can be obtained. In the absence of this plas- 
ticity, context recognition may be impaired and the certainty of actions 
reduced. Meanwhile, removal of CP-AMPARs from mPFC to DIR-MSN 
synapses led to a failure of discrimination between the cocaine-associated 
lever and a second lever on which responding had no consequence. This 
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establishes cocaine-evoked plasticity at mPFC afferents to NAc D1R- 
MSNs as necessary to allow correct action—outcome selection during 
seeking. Finally, restoring transmission in both inputs abolished cue- 
induced cocaine seeking. Thus, cocaine-evoked plasticity at multiple 
inputs, with distinct expression mechanisms that change both the effi- 
cacy and quality of transmission, are collectively necessary for the full 
expression of cue-associated drug-seeking behaviour. 

Previous studies have recognized the importance of NAc excitatory 
transmission in drug-adaptive behaviours using systemic or local appli- 
cations of pharmacological agents that are active during the behavioural 
test’, However, a key difference in the present study is that optogenetic 
protocols were applied outside the test and remained effective 1 week 
later. Thus, findings here provide support for a model whereby a cocaine 
memory trace commandeers accumbal integration to control core com- 
ponents of relapse. Co-opting endogenous plasticity mechanisms emerges 
as a radically new way to modify behaviour and may serve as a blue- 
print for defining synaptic causalities in other synaptic disorders. 


METHODS SUMMARY 


Mice learned to self-administer cocaine or saline during once-daily 2 h sessions for 
10 consecutive days”’. In each self-administration session, mice could respond on 
either an active lever that resulted in a single infusion (0.75 mg kg ' cocaine or saline) 
paired with a 5 s cue light and followed by a 20 s time-out period, or a second inactive 
lever that had no consequence. Responding was maintained under a fixed-ratio one 
schedule (one active lever press gives one infusion) during sessions 1-5, and fixed- 
ratio two schedule during sessions 6-10. After the final self-administration ses- 
sion, mice were retained in their home cage for 1 month of forced withdrawal. The 
60 min test of cue-associated cocaine seeking took place in the same box where self- 
administration training occurred. Responses on the active lever now triggered a5 s 
illumination of the cocaine-associated cue light, but without intravenous infusion, 
whereas inactive lever responses had no consequence. Optogenetic protocols were 
applied once in vivo, in the home cage, through bilaterally implanted optical fibres 
targeting the NAc shell 4h before the seeking test, or before animals were killed for 
ex vivo recordings. For whole-cell patch clamp recordings, the rectification index 
was calculated after measuring light-evoked EPSC amplitudes at different holding 
potentials (—70, 0 and +40 mV) in the presence of (D)-2-amino-5-phospho-novaleric 
acid (AP5; 50 uM). The A/N ratio was calculated at +40 mV with the AMPAR com- 
ponent pharmacologically isolated using AP5 (50 1M) and the NMDAR-EPSC com- 
ponent determined by subtraction. When using ChR2-eYFP in BAC transgenic mice, 
D2R-MSNs were identified as D2R-eGFP (enhanced green fluorescent protein)- 
positive neurons in Drd2-eGFP mice or as tomato-negative neurons in Drdla- 
tdTomato mice, whereas D1R-MSNs were always identified as tomato-positive 
cells. This approach was validated in a previous report”®. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Animals. Mice were C57BL/6 or heterozygous BAC transgenic mice in which tomato 
expression was driven by D1R (Drdla-tdTomato from Jackson Laboratories) gene 
regulatory elements or eGFP driven by D2R (Drd2-eGFP from GENSAT) gene 
regulatory elements. Males and females were used. Transgenic mice had been back- 
crossed in the C57BL/6 line for a minimum of four generations. Mice were single 
housed after surgery. All animals were kept in a temperature- and humidity-controlled 
environment with a 12h light/12 h dark cycle (lights on at 7:00). All procedures 
were approved by the Institutional Animal Care and Use Committee of the Uni- 
versity of Geneva. 

Stereotaxic injections. AAV1-CAG-ChR2-eYFP (eYFP; also called venus) or AAV1- 
CAG-eYFP (for control mice) produced at the University of North Carolina (Vector 
Core Facility) was injected into the mPFC (focusing on the infralimbic area), vVHipp 
or BLA of 5- to 6-week-old mice. Anaesthesia was induced at 5% and maintained at 
2.5% isoflurane (w/v) (Baxter AG) during the surgery. The animal was placed in a 
stereotaxic frame (Angle One) and craniotomies were performed using stereotaxic 
coordinates adapted from a mouse brain atlas“ (for mPFC: anterior—posterior = +1.9; 
medial-lateral = + 0.3; dorsal-ventral (from the surface of the brain) = —2.5; for 
vHipp: anterior-posterior = —3.5; medial-lateral = + 2.8; dorsal-ventral = — 4.2; 
for BLA: anterior—posterior = —1.4; medial-lateral = + 3.0; dorsal-ventral = —3.8). 
Injections of virus (0.5 pil per injection site) used graduated pipettes (Drummond 
Scientific Company), broken back toa tip diameter of 10-15 jm, at an infusion rate 
of ~0.05 jl min” '. Injections of ChR2 were made a minimum of 1 week before the 
self-administration surgery, and thus optogenetic manipulations typically occurred 
no sooner than 7 weeks after the injection. For anterograde tracing studies, 6- to 
8-week-old C57BL/6] mice were infected with both AAV5-EFla-mCherry in the 
mPFC and AAV5-EF1a-eY FP in the vHipp to allow observation of multiple inputs 
into NAc. 

For retrograde tract-tracing studies, mice were stereotaxically injected with the 

retrograde tracer, cholera toxin subunit B conjugated to a fluorescent marker (cho- 
lera toxin subunit B, Alexa Fluor 594 conjugate, Molecular Probes). Microinjection 
needles were placed into the NAc medio-dorsal (anterior—posterior = + 1.18; medial- 
lateral = +0.5; dorsal-ventral = —3.7) or NAc medio-ventral (anterior—posterior = 
+1.18; medial-lateral = +0.5; dorsal-ventral = —4.2) and 0.5 il was injected over 
5 min. The injector was left in place for an extra 5 min to allow for diffusion of 
cholera toxin subunit B particles away from the injection site. Animals returned to 
their home cages for 14 days before tissue preparation. 
Tissue preparation and immunofluorescence. Mice were rapidly anaesthetized 
with pentobarbital (500 mgkg ', intraperitoneally, Sanofi-Aventis) and trans- 
cardially perfused with 4% (w/v) paraformaldehyde in 0.1 M sodium phosphate 
buffer (pH 7.5) (ref. 45). Brains were post-fixed overnight in the same solution 
and stored at 4°C. Thirty-micrometre-thick sections were cut with a vibratome 
(Leica) and stored at —20 °C in a solution containing 30% (v/v) ethylene glycol, 
30% (v/v) glycerol, and 0.1 M sodium phosphate buffer, until they were processed 
for immunofluorescence. Sections were processed as follows. Free-floating sec- 
tions were rinsed in Tris-buffered saline (TBS: 0.25 M Tris and 0.5 M NaCl, pH7.5), 
incubated for 5 min in TBS containing 3% HO, and 10% methanol (v/v), and then 
rinsed three times 10 min in TBS. After incubation for 15 min in 0.2% (v/v) Triton 
X-100 in TBS, sections were rinsed three times in TBS again. Sections were then 
incubated for 1 h ina solution of BSA 3% in TBS. Finally, they were incubated 72 h 
at 4°C with the primary antibodies mouse anti- DARPP-32 (1:1,000, a gift from 
P. Greengard, mouse monoclonal clone C54 (ref. 46)) and rabbit anti-RFP (1:1,000, 
Medical & Biological Laboratories, rabbit polyclonal, lot 042). After incubation 
with the primary antibodies, sections were rinsed three times for 10 min in TBS and 
incubated for 45 min with goat Cy3- and Cy5-coupled secondary antibodies (1:400, 
Jackson ImmunoResearch, lot 114787 and Life Techonologies 1305923). Sections 
were rinsed for 10 min twice in TBS and twice in tris-buffer (0.25 M Tris) before 
mounting in 1,4-diazabicyclo-[2.2.2]-octane (DABCO, Sigma-Aldrich). Double- 
or triple-labelled images from each region of interest were obtained using sequential 
laser scanning confocal microscopy (Zeiss L5M510 META). Photomicrographs were 
obtained with the following band-pass and long-pass filter settings: GFP (band- 
pass filter: 505-530), Cy3 (band-pass filter: 560-615) and Cy5 (long-pass filter 
650). The objectives and the pinhole setting (1 airy unit) remained unchanged 
during the acquisition ofa series for all images. The thickness of the optical section 
was ~ 1.6 [im with a X20 objective and ~6 kum with a X 10 objective. GFP-labelled 
neurons were pseudo-coloured green and other immunoreactive neurons were 
pseudo-coloured red or blue. For images showing infected structure with AAV1- 
CAG-ChR2-eYFP, 30 m-thick coronal sections were cut with a vibratome (Leica), 
stained with Hoechst (Sigma-Aldrich) and mounted with Mowiol (Sigma-Aldrich). 
Full images of brain slices from mPFC, BLA and vHipp were obtained with a 
MIRAX (Carl Zeiss) system equipped with a Plan- Apochromat X 20/0.8 objective, 
together with 4’,6-diamidino-2-phenylindole (DAPI) (emission: 455/50) and fluo- 
rescein isothiocyanate (FITC) (emission: 515-565) filters. 
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Implantation of jugular vein catheter. The surgical procedure was adapted from 
refs 47, 48. Mice were anaesthetized with a mix of ketamine (60 mg kg’, Ketalar) 
and xylazine (12 mg kg M Rompun) solution. Catheters (CamCaths, model MIVSA) 
made of silicone elastomer tubing (outside diameter 0.63 mm, inside diameter 
0.30 mm) were inserted 1.0-1.2 cm in the right jugular vein, about 5 mm from the 
pectoral muscle, to reach the right atrium. The other extremity of the catheter was 
placed subcutaneously in the mid-scapular region and connected to stainless steel 
tubing appearing outside the skin. Blood reflux in the tubing was checked to con- 
firm correct placement of the catheter. Mice were allowed to recover for 3-5 days 
before the start of drug self-administration training and received antibiotics (Baytril 
10%, 1 ml in 250 ml of water) in the drinking water during this period. Catheters 
were flushed daily with a heparin solution (Heparin ‘Bichsel’) in saline (30 IU) during 
the recovery period and just before and after each self-administration session. 
Self-administration apparatus. All behavioural experiments were performed dur- 
ing the light phase and took place in mouse operant chambers (ENV-307A-CT, Med 
Associates) situated in sound-attenuating cubicle (Med Associates). Two retract- 
able levers were present on both sides of one wall of the chamber and a food pellet 
dispenser was also present only for procedures involving food delivery. A cue-light 
was located above each lever and a house light was present in each chamber. Dur- 
ing intravenous drug self-administration sessions, the stainless steel tubing of the 
catheter device was connected through a CoEx PE/PVC tubing (BCOEX-T25, Instech 
Solomon) toa swivel (Instech Solomon) and then an infusion pump (PHM-100, Med- 
Associates). The apparatus was controlled and data captured using a PC running 
MED-PC IV (Med-Associated). 

Drug self-administration acquisition. Mice were deprived of food for 12 h before 
the first self-administration session to promote exploratory activity. No food shap- 
ing was undertaken and mice were given food access ad libitum after the first ses- 
sion. Each session was 120 min in duration and started with the illumination of the 
house light and the insertion of the two levers into the operant chamber. During 
the first five sessions, a single press on the active lever (termed fixed-ratio one, or 
FR1) resulted in an infusion of 0.75 mgkg * of cocaine (cocaine hydrochloride, 
provided by the pharmacy of Geneva University Hospital, dissolved in 0.9% saline 
at 0.75 mg ml and delivered at 0.0177 mls__' asa unit dose depending on the weight 
of the mouse) paired with a 5 s continuous illumination of the cue light above the 
active lever. Completion of the fixed-ratio also initiated a timeout period of 20s 
during which cocaine was no longer available. For the next five sessions, two lever 
presses were needed to activate the infusion pump (FR2). The active lever (left or 
right lever) was randomly assigned for each mouse. To avoid an overdose of cocaine, 
a maximum of 45 infusions were allowed per session. Only mice having reached a 
stable rate of at least 70% of correct lever responses (number of active lever responses 
divided by total lever responses over the three last sessions of acquisition) were 
included in the study. Saline control mice undertook the same procedure as cocaine 
mice except that saline (NaCl 0.9% B. Braun) replaced cocaine infusions. After 
acquisition, mice were randomly attributed to either behaviour or electrophysiology 
experiments (except wild-type mice, which were only used in behaviour experi- 
ments). Recordings were never performed after cue-associated seeking tests to avoid 
confounding effects on synaptic transmission”. 

Optic fibre cannulation and in vivo optogenetic stimulation protocols. After 
completing the acquisition of self-administration, all mice were put in forced absti- 
nence for 30 days in their home cage. At days 15-20, mice destined for seeking tests 
or ex vivo validation were bilaterally implanted with a chronically indwelling optic- 
fibre cannula (made in house as described in ref. 50) into the NAc (anterior—posterior = 
+1.5; medial-lateral = +1.6; dorsal-ventral = —3.9 with a 10° angle) using stereo- 
taxic apparatus as described above. Two screws were fixed into the skull to support 
the implant, which was further secured with dental cement. 

DPSS blue light lasers (MBL-473/50 mW; CNI Lasers) connected to the indwel- 
ling fibre optic by customized patch cords (BFL37-200 Custom; Thor Labs) anda 
double rotary joint (FRJ-1X2i; Doric Lenses) allowed mice to move freely during 
stimulation. The laser was triggered to deliver 4 ms pulses at 1 Hz or 13 Hz for 
10 min, and the protocol was applied in the home cage 4h before the first drug- 
seeking session or killing for ex vivo electrophysiology recordings. Mice from the 
control group were, in a randomized manner, either not infected with ChR2 or 
infected with eYFP only and received one of the optogenetic protocols, or were 
infected with ChR2 but were not exposed to light-stimulation. Seeking behaviour 
did not differ among these three control conditions, so the data were collapsed. 
Test of cue-associated drug seeking. Thirty days after the final self-administration 
session (that is, day 40), mice were assigned to optogenetic protocol groups (control, 
1 or 13 Hz) according to performance during self-administration acquisition, such 
that acquisition did not differ between the groups. Control animals were included for 
each batch of tested animals explaining the greater number of mice in the control 
group. The cue-associated drug-seeking test was a 60 min session, identical to the 
cocaine acquisition period (house light on, insertion of the two levers), except that 
one press on the active lever (FR1 schedule) now triggered illumination of the cue 
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light for 5 s but without a cocaine infusion or a timeout period. The infusion pump 
was also activated during the drug-seeking session, because the pump noise pro- 
vided an extra drug-associated cue. One week later (that is, day 48), mice under- 
took a second drug-seeking session (with no further optogenetic stimulation 4h 
before) to assess the persistence of the optogenetic protocol on seeking behaviour. 
At the end of any behavioural experiment, mice were euthanized and brains fixed 
in paraformaldehyde to prepare histological slices for verification of ChR2 expres- 
sion and cannula placement (see Extended Data Fig. 9). 

Food training after drug seeking. Immediately after the second drug-seeking test, 
subgroups of mice were food deprived for 12 h and then started nine consecutive 
sessions of operant training for food to examine whether optogenetic protocols had 
long-term effects on the acquisition of new learning for a natural reward. The test 
was performed in similar chambers as used previously (ENV-307A-CT, Med Asso- 
ciates), except that two nose-poke holes were present instead of levers at each side 
of a food pellet dispenser. One nose-poke in the active hole (randomly assigned) 
illuminated the cue light for 5 s and triggered the delivery of one food pellet (50% 
sucrose, AIN-76A Rodent Tablet 20 mg, Testdiet). The response requirement was 
set at FR1 for sessions 1-3, FR2 for sessions 4—6 and FR3 for sessions 7-9. The house 
light was turned off during the entire session. Each session ended after 30 min or 
when 45 pellets had been delivered, whichever occurred first. Each mouse received 
2.7-3 g per day of standard laboratory chow during the food training procedure to 
maintain a stable body weight without any further weight gain. 

Effect of optogenetic protocols on cue-associated food seeking. To assess the 
specificity of the optogenetic treatment for drug versus food cue-associated seek- 
ing, we also tested a new cohort of mice for cue-associated food seeking after the 
optogenetic protocol. Mice underwent food training in the same conditions as for 
the cocaine self-administration group except that cocaine infusions were replaced 
by sucrose pellet delivery. Sessions ended after 30 min or when 45 pellets were deli- 
vered, whichever occurred first. After implantation of fibre optic cannula at day 
15-20 of forced abstinence from sucrose pellet consumption, mice were assigned to 
the control or 13 Hz groups (such that acquisition did not differ between the groups) 
and received the optogenetic treatment 4 h before a test for food-reward seeking that 
was undertaken in the same way as the test for cocaine seeking. The sound of the 
pellet dispenser was also paired with the food cue light during the session. Food seek- 
ing sessions lasted 30 min. Mice received 2.7-3 g per day of standard laboratory 
chow over the entire procedure to maintain a stable body weight (no weight gain). 
Slice electrophysiology. Coronal 200-250 um slices of mouse brain were prepared 
in cooled artificial cerebrospinal fluid containing (in mM): NaCl 119, KC12.5, MgCl 
1.3, CaCl, 2.5, NagHPO, 1.0, NaHCO; 26.2 and glucose 11, bubbled with 95% O2 
and 5% CO3. Slices were kept at 32-34 °C in a recording chamber superfused with 
2.5 ml min artificial cerebrospinal fluid. Visualized whole-cell voltage-clamp record- 
ing techniques were used to measure holding and synaptic responses of MSNs of 
the NAc shell, identified by the presence of the eGFP or td-Tomato of BAC trans- 
genic mice by using a fluorescent microscope (Olympus BX50WI, fluorescent light 
U-REL-T). Note that when using ChR2-eYFP in BAC transgenic mice, D2R-MSNs 
were identified as D2R-eGFP-positive neurons in Drd2-eGFP mice, which could 
be visualized against fluorescence from ChR2-eYFP, or as tomato-negative neurons 
in Drdla-tdTomato mice. DIR-MSNs were always identified as tomato-positive 
cells. This approach was validated in a previous report (ref. 28). The holding poten- 
tial was —70 mV, and the access resistance was monitored by a hyperpolarizing step 
of —14mV with each sweep, every 10s. The liquid junction potential was small 
(—3 mV), and therefore traces were not corrected. Experiments were discarded if 
the access resistance varied by more than 20%. Currents were amplified (Multiclamp 
700B, Axon Instruments), filtered at 5 kHz and digitized at 20 kHz (National 
Instruments Board PCI-MIO-16E4, Igor, WaveMetrics). All experiments were per- 
formed in the presence of picrotoxin (100 11M). 


For recordings of light- and electrically evoked EPSCs, the internal solution 
contained (in mM) 130 CsCl, 4 NaCl, 5 creatine phosphate, 2 MgCl, 2 NA2ATP, 
0.6 NA3GTP, 1.1 EGTA, 5 HEPES and 0.1 mm spermine. In some cases, QX-314 
(5mm) was added to the solution to prevent action currents. Synaptic currents 
were electrically evoked by stimuli (50-100 1s) at 0.1 Hz through bipolar stainless 
steel electrode placed onto the tissue. For optogenetic experiments, light-EPSCs 
were evoked with 4 ms blue light pulses from an optic fibre directed at the NAc 
shell that was coupled to a DPSS blue light laser. Low-frequency stimulation (1 or 
13 Hz for 10 min) was applied with 4 ms light pulses and the magnitude of LTD 
was determined by comparing average EPSCs that were recorded 20-30 min after 
induction to EPSCs recorded immediately before induction. To isolate AMPAR- 
evoked EPSCs, the NMDA antagonist D-AP5 (501M) was bath applied. The 
NMDAR component was calculated as the difference between the EPSCs mea- 
sured in the absence and in the presence of D-AP5. The AMPAR/NMDAR ratio 
was calculated by dividing the peak amplitudes. The rectification index of AMPAR 
was calculated as the ratio of the chord conductance calculated at negative potential 
divided by chord conductance at positive potential. Note that, in example traces, 
stimulation artefacts were removed. 

For recordings of mEPSCs the internal solution contained (in mm): 140 K Glu, 
2 MgCl,, 5 KCI, 0.2 EGTA, 10 HEPES, 4 NA,ATP, 0.3 Na;GTP and 10 creatine 
phosphate. Miniature EPSCs were recorded in the presence of tetrodotoxin (0.5 |1M). 
The frequencies, amplitudes and kinetic properties of these currents were then 
analysed using the Mini Analysis software package (version 4.3, Synaptosoft). Note 
that recordings were performed with the electrophysiologist blinded to the self- 
administration condition (cocaine or saline), although for practical reasons the exper- 
imenter was aware of other conditions (for example, ChR2 infected or not, etc.). 
Statistics. No statistics were used to determine group sample size; however, sam- 
ple sizes were similar to those used in previous publications from our group and 
others reporting self-administration in mice. Multiple comparisons were first sub- 
ject to mixed-factor ANOVA defining both between- (for example, D1R- or D2R- 
MSN cells; saline or cocaine self-administration groups; control, 1 Hz or 13 Hz 
protocols; BLA, vHIPP, mPFC inputs, etc.) and/or within- (for example, active or 
inactive levers) group factors. Where significant main effects or interaction terms 
were found (P = 0.05; or P = 0.1 indicative of a trend), further comparisons were 
made by a two-tailed Student’s t-test with Bonferroni corrections applied when 
appropriate (that is, the level of significance equalled 0.05 divided by the number 
of comparisons). Single comparisons of between- or within-group measures were 
made by two-tailed non-paired or paired Student’s t-test, respectively. ANOVAs 
for main figures are provided (see Extended Data Table 1a, b) 
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Extended Data Figure 1 | Graphical abstract. Top left: main excitatory GluA2-lacking AMPARs whereas more GluA2-containing AMPARs are 
afferents onto NAc shell D1R-MSNs (BLA: basolateral amygdala, vHipp: added at vHipp synapses. Effects of NMDAR- or mGluR1-dependent 
ventral subiculum of the hippocampus and mPFC: medial prefrontal cortex), _ (1 or 13 Hz, respectively) light protocols applied at specific inputs (shown 
which at baseline contain synapses that express NMDARs and GluA2- in green) on cocaine-evoked plasticity are illustrated, together with the 


containing AMPARs. Top right: 1 month after withdrawal (WD) from cocaine consequence for cue-associated seeking behaviour. 
self-administration (SA), mPFC synapses onto NAc shell D1R-MSNs express 
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Extended Data Figure 2 | Identification and optogenetic targeting of 
excitatory inputs to the NAc shell. a, Retrograde labelling with cholera toxin 
subunit B (AF594) injected into the NAc shell. Confocal images of injection 
sites (top) in the medio-dorsal (left) and medio-ventral (right) NAc shell are 
shown, regions where electrophysiology recordings were performed. 

b, Labelled cell bodies in corresponding projection areas (basolateral amygdala, 
ventral subiculum of the hippocampus and medial prefrontal cortex) are 
shown, with no discernable segregation between the medio-dorsal or medio- 
ventral NAc shell. For each projection area, the insert shows a complete 
hemisphere coronal section together with a zoomed image of the region of 
interest (indicated by yellow box). Il, infralimbic; CeL; central amygdala lateral; 
BLP, basolateral amydala posterior; BMP, basomedial amygdala posterior; 
PV, paraventricular thalamic nucleus; vHipp, central subiculum of the 
hippocampus; VIEnt, ventral intermediate entorhinal cortex; VTA, ventral 
tegmental area. c, Schematic of experiment (top) with light-evoked EPSCs 
recorded in D1IR-MSNs of the NAc shell of mice infected with AAV1-ChR2- 
eYFP in the BLA (bottom left), vHipp (middle) or vmPFC (right) before and 
after bath application of glutamate receptor antagonists (NBQX 10 uM and 
AP5 50 uM for AMPAR and NMDAR, respectively). Scale bars, 20 ms, 50 pA. 
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Extended Data Figure 3 | Individual MSNs receive inputs from multiple 
projection areas. a, Confocal images of NAc from a mouse infected with 
AAV5-EF1-eYFP and AAV5-EF1-mCherry in the vHipp (left) and mPFC 
(right), respectively, at low magnification (first row). At higher magnification 
(second and third rows) eYFP from vHipp and mCherry from mPFC are 
present around MSNs stained by DARPP-32 (blue). aca, anterior commissure. 
Scale bar, 50 um. 
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Extended Data Figure 4 | Cocaine self-administration does not evoke 
input-specific plasticity in D2R-MSNs. a, Top, schematic of whole-cell 
recordings of NAc shell D2R-MSNs of mice that 1 month previously 
self-administered saline (open points) or cocaine (filled points) and were 
infected with AAV1-ChR2-eYFP in the BLA (left), vHipp (middle) or mPFC 
(right). Bottom, after cocaine self-administration the mean amplitude of 
light-evoked EPSCs was not changed at any input onto D2R-MSNs (effect of 
group (saline/cocaine) and group X input (BLA/vHipp/mPFC) all not 
significant) (n = 10/14 for BLA (saline/cocaine), n = 10/20 for vHipp and 

n = 60/51 for mPFC). b, For each input, the rectification index (RI) was 
calculated. Example traces are shown (top), with the I/V plot (middle) and 
group mean rectification index data (bottom). Cocaine did not modify 
normalized AMPAR-EPSCs at +40 mV from BLA, vHipp or mPFC inputs 
(tie 0.20, P = 0.84, tig = 0.44, P = 0.67 and ty = 0.43, P = 067, 
respectively). The rectification index was also unchanged at D2R-MSN 
synapses from BLA, vHipp or mPFC inputs (t,2 = —0.32, P = 0.75, 

tig = —0.51, P = 0.62 and ty) = —0.67, P = 0.51 respectively). Scale bars, 

20 ms, 20 pA. ¢, For the same cells as shown in b, the A/N ratio was calculated. 
For each input, example traces are shown (top), with group mean A/N ratios 
(bottom). Cocaine did not alter the A/N ratio at inputs onto D2R-MSNs from 
the BLA, vHipp or mPFC (t,2 0.19, P= 0.85, t)g = 1.20, P= 0.25 and 

too = —0.04, P = 0.97, respectively). Scale bars, 20 ms, 20 pA. Error bars, s.e.m. 
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Extended Data Figure 5 | mPFC and NAc recordings during 1 and 13 Hz 
optogenetic protocols and LTD in NAc D2R-MSNs induced by mPFC 
protocols applied after saline and cocaine self-administration. a, Right, 
schematic of whole-cell recordings in the mPFC or NAc from mice infected 
with ChR2 in mPFC. Top, light-evoked action potentials recorded in current 
clamp of ChR2-infected mPFC neurons and EPSCs recorded in voltage-clamp 
of D1R-MSNs (bottom) during the beginning of the 1 Hz (left) or 13 Hz (right) 
stimulation protocols. Note that EPSCs fail to follow the 13 Hz protocol. 

b, Top left, schematic of experiment. Bottom, graph of normalized light-evoked 


EPSCs across time recorded in NAc D2R-MSNs from saline and mice that 
self-administered cocaine (each point represents mean of six sweeps), together 
with example traces (mean of 20 sweeps) before (1) and after (2) a 1 (left) or 
13 Hz (right) light protocol was applied ex vivo (4 ms pulses at 1 or 13 Hz, 

10 min). One month after saline or cocaine self-administration, the 1 Hz and 
13 Hz protocols induced comparable LTDs in both groups (for 1 Hz: 50 + 4.9% 
to 40 + 2.6%, Student’s t-test t)4 1.86, P = 0.080; n = 7-9 cells; for 13 Hz: 
28 + 3.7% to 33 + 4.2%, Student’s t-test t,g = 1.41, P= 0.18; n = 11-9 cells). 
Error bars, s.e.m. 
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Extended Data Figure 6 | Optogenetic protocols applied in vivo reverse 
cocaine evoked-plasticity at NAc DIR-MSNs. a, Top, schematic of 
experiment. Mice were infected with ChR2 in the vHipp or mPFC and trained 
in saline or cocaine self-administration. Mice were then implanted with fibre 
optics targeting the NAc shell, and optogenetic protocols were applied in vivo 
1 month after withdrawal. Four hours later, acute brain slices were prepared to 
assess the efficiency of optogenetic protocols applied in vivo to reverse cocaine- 
evoked plasticity at NAc DIR-MSNs. In brief, consistent with the ex vivo 
validation, the 1 Hz vHipp protocol applied in vivo normalized the A/N 
ratio, whereas the 13 Hz mPFC protocol normalized the J/V curve and 
rectification index in mice that self-administered cocaine. b, Left, schematic of 
experiment indicating that mice were infected with ChR2 in the vHipp. The 
rectification index and A/N ratio were determined with light-evoked EPSCs as 
described previously (see Fig. 2). The I/V plot is shown, together with group 
mean rectification index and A/N data. Cocaine did not alter normalized 
AMPAR-EPSCs recorded at +40 mV or the rectification index from vHipp 
inputs (same data as Fig. 3). The 1 Hz protocol applied in vivo was without 
effect on either of these measures in mice that self-administered saline or 
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cocaine (effect of group (saline versus cocaine), protocol (control versus 1 Hz) 
and group X protocol, all not significant). The A/N ratio was increased at 
vHipp inputs in mice that self-administered cocaine (same data as Fig. 3), an 
effect that was reduced after the in vivo 1 Hz protocol (planned comparison, 
after ANOVA, by t-test, ta¢ = 4.97, °P < 0.001) (m = 13/15 for 1 Hz saline/ 
cocaine group). ¢, As for b, except that mice were infected with ChR2 in the 
mPFC. Cocaine decreased AMPAR-EPSCs recorded at +40 mV and increased 
the rectification index from mPFC inputs (same data as Fig. 3). The 13 Hz 
protocol applied in vivo increased AMPAR-EPSCs at +40 mV in mice that 
self-administered cocaine (planned comparison with cocaine control group, 
after ANOVA, by t-test, too = 3.6, °P < 0.01) and decreased the rectification 
index in mice that self-administered cocaine (planned comparison with cocaine 
controls, after ANOVA, by t-test, tay = 5.2, °P < 0.001). The A/N ratio was 
decreased at mPFC inputs in mice that self-administered cocaine (same data 
as Fig. 3), an effect that was normalized after the in vivo 13 Hz protocol 
(planned comparison with cocaine controls, after ANOVA, by t-test, too = 2.8, 
°P = 0.01) (n = 11/9 for 13 Hz saline/cocaine group). Error bars, s.e.m. 
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Extended Data Figure 7 | Assessing effects of in vivo light stimulation on 
mEPSCs recorded in D2R- and D1R-MSNs. a, Top, schematic of experiment. 
Mice were infected with ChR2 in the vHipp or mPFC and trained in saline or 
cocaine self-administration. Mice were then implanted with fibre optics 
targeting the NAc shell and optogenetic protocols applied in vivo 1 month after 
withdrawal. Four hours later, acute brain slices were prepared to assess the 
effect of optogenetic protocols applied in vivo on global excitatory transmission 
by recording mEPSCs at NAc D2R-MSNs. In brief, recordings from 
D2R-MSNs showed that mEPSCs were not affected by cocaine and not 
depressed by optogenetic LTD protocols applied in vivo. Note that although 
optogenetic protocols efficiently induced LTD at single inputs onto D2R- 
MSNs (Extended Data Fig. 5), this was not reflected by a decrease in mEPSC 
amplitudes. This may be accounted for by a presynaptic expression mechanism 
of LTD or that baseline amplitudes were already low such that a further 
depression only at a single input could not be measured by mEPSCs (that is, 
floor effect), which reflects synaptic transmission from multiple inputs. 

b, Example of mEPSCs recorded ex vivo in NAc shell D2R-MSNs in the 
presence of picrotoxin (100 LM) and tedrodotoxin (0.5 1M) (sample traces 
comprising six superimposed, 4s traces). Scale bars, 20 pA, 500 ms. 

c, Histograms of group mean data of D2R-MSN mEPSC amplitudes (left) and 
frequency (right) are shown in control saline (sal) or cocaine (coc) conditions 
and after application of 1 Hz or 13 Hz light protocols at VHipp or mPFC 
synapses. Mean mEPSC amplitudes and frequencies were not changed by 
cocaine, and were not significantly decreased by protocols applied at either 
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vHip or mPFC inputs (” = 5 to 13 cells per group). Error bars, s.e.m. d, e, As for 
a and b except in DIR-MSNs. In brief, the frequency of mEPCS was not 
affected by cocaine self-administration or laser protocols. In contrast, in mice 
that self-administered cocaine the amplitude of mEPSCs was significantly 
larger than controls, in line with a postsynaptic expression mechanism. 
Protocols that were most efficient at restoring the A/N ratio at vVHipp synapses 
when assessed on slice, namely the 13 Hz mPFC and the 1 Hz vHipp protocol, 
were also most efficient at restoring baseline mEPSC amplitudes when applied 
in vivo. This suggests that cocaine-evoked plasticity at vHipp inputs largely 
accounts for the observed increase in mEPSC amplitudes. f, Mean mEPSC 
amplitudes were increased by cocaine self-administration (versus saline 
control, by t-test, t)g = 7.13, *P < 0.001). Protocols applied at vHipp terminals 
altered mEPSC amplitudes (one-way ANOVA comparing cocaine control, 1 
and 13 Hz vHipp protocols: effect of protocol F;3 = 5.7, P< 0.01). The 1 Hz 
but not the 13 Hz vHipp protocol reduced mEPSC amplitudes (versus cocaine 
control, for 1 Hz: tyg = 2.8, °P = 0.01), although amplitudes remained 
significantly higher than saline control mice after either protocol (all P< 0.01). 
Protocols applied at mPFC terminals also altered mEPSC amplitudes (effect of 
protocol, F225 = 13.1, P< 0.001), and the 13 Hz but not 1 Hz mPFC protocol 
reduced mEPSC amplitudes (versus cocaine control, for 1 Hz: ty) = 4.7, 

°P < 0.001). Moreover, amplitudes after the 13 Hz mPFC protocol did not 
differ from saline controls (P = 0.14). The frequencies of mEPSCs were not 
altered by cocaine or by protocols applied at either VHipp or mPFC inputs 
(n = 8-11 cells per group). All error bars, s.e.m. 
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Extended Data Figure 8 | Optogenetic effect on cocaine seeking is 
persistent, and does not alter learning of a new response for food reward or 
food cue-associated seeking. a, Schematic of experiment. Cocaine control and 
13 Hz mPFC group mice were exposed to a second cocaine cue-associated 
seeking test 1 week after the first test (see Fig. 5), but without further 
optogenetic intervention. Two days after this second test, subgroups of mice 
from each condition were trained to nose-poke for sucrose pellets during once- 
daily 30 min sessions over 9 days. b, Raster plots of cue-associated seeking 
behaviour (active and inactive lever presses) during the second test are shown 
(left), together with group mean data showing active (a: filled boxes) and 
inactive (i: open boxes) lever presses. Seeking was present in cocaine control 
mice during the second test, but was absent in the 13 Hz mPFC group (effect of 
group, group and lever group, all P < 0.05; active versus inactive lever 
comparison by f-test, t4; = 6.13, *P <0.001 for controls n = 44 and to = 0.97, 
P= 0.35 for 13 Hz mPFC, n = 10; active versus active lever comparison, 

° P<0.01). ¢, For the acquisition of a new response test, group mean values are 
shown of active and inactive nose-pokes across training days. Responding was 


maintained by FR1, FR2 and FR3 schedules. ANOVA comparisons of nose- 
poke responding across 3 days at each fixed ratio schdule confirmed no 
difference between cocaine self-administration control (n = 15) and 13 Hz 
mPFC (n = 4) mice (effects of group and day X group X nose poke 
interactions, all not significant). d, Schematic of experiment. One month after 
food self-administration training, mice received the 13 Hz mPFC light 
protocol, 4h before a 30 min test of seeking. e, Graph shows mean of active (a) 
and inactive (i) lever presses for sucrose pellets by new cohort of mice (n = 14) 
infected or not with ChR2 in the mPFC. Training parameters were identical to 
those used for the acquisition of cocaine self-administration, except that 
sessions lasted only 30 min. f, Raster plots of active and inactive lever presses 
during a cue-associated sucrose seeking session (left), together with group 
mean data of total active (a) and inactive (i) lever presses during the 30 min 
seeking test. Food seeking was robust in control mice and did not differ in the 
13 Hz mPFC group (effect of lever, F,,1. = 163.7, P< 0.002; group and 

lever X group all non-significant; n = 6/8). Error bars, s.e.m. 
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Extended Data Figure 9 | Position of fibre optic cannula placements for targeting either vVHipp or mPFC to NAc synapses (right). Note that when mice 
mice used in tests of cue-seeking. Figure shows identified fibre optic cannula _ were used for electrophysiology recordings, fibre optic placements were 
tip placements in controls (left) or mice that received different light protocols _ visually confirmed but not recorded. 
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Extended Data Table 1 | ANOVA comparisons for (a) Figs 1-4 and (b) Fig. 5 


a b 


Figure Measure ANOVA factors Effect e 
tc Ril Cell (D1R-MSN, D2R-MSN) x Group (saline, cocaine) Group F(1,83)=10.86 0.001 
Cell F(1,83)=3.18 — <0.1 
Group x Cell F(1,83)=6.08 <0.05 
AIN Cell (D1R-MSN, D2R-MSN) x Group (saline, cocaine) Group F(1,83)=5.91  <0.05 
Cell F(1,83)=11.91 0.001 
Group x Cell _F(1,83)=8.34<0.01 
2b Amplitudes Group (saline, cocaine) x Input (BLA, vHipp, mPFC) Group F(1,206)=6.44 <0.05 
Input F(2,206)=18.01 <0.001 


Group x Input F(2,206)=4.14 <0.05 


2c Ril Group (saline, cocaine) x Input (BLA, vHipp, mPFC) Group F(1,56)=14.56  <9.001 
Input F(2,56)=9.87 <0,001 


Group x Input F(2,56)=18.03  <0.001 


2d AIN Group (saline, cocaine) x Input (BLA, vHipp, mPFC) Group F(1,56)=1.64 ns 
Input F(2,56)=11.44  <0,001 


Group x Input _F(2,56)=33.41 <0,001 


3d Amplitude @ Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,45)=0.32 ns 
+40 mV (vHipp) Protocol F(2,45)=0.37 ns. 


Group x Protocol F(2,45)=0.23 ns. 


RI (vHipp) Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,45)=1.39 ns 
Protocol F(2,45)=0.64 ns. 


Group x Protocol F(2,45)=0.78 ns. 


3e AN (vHipp) Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,46)=59.43 <0.001 
Protocol F(2,46)=9.76 <0.001 


Group x Protocol F(2,46)=5.73  <0.01 


3i Amplitude @ Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,54)=15.72 <0.001 
+40 mV (mPFC) Protocol F(2,54)=3.5 <0.05 


Group x Protocol F(2,54)=2.57  <0.1 


RI(mPFC) — Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,54)=33,23 <0.001 
Protocol F(2,54)=13.18  <0.001 


Group x Protocol F(2,54)=10,72 <0.001 


3) AIN(mPFC) Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,54)=3.08 — <0.1 
Protocol F(2,54)=5.6  <0.01 
Group x Protocol F(2,54)=6.0 <0.01 

4b Amplitude @ Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,80)=9.92  <0.05 


+40 mV (vHipp) Protocol F(2,80)=3.37  <0.05 


Group x Protocol F(2,80) 


RI (vHipp) Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,80)=11.8 0.001 
Protocol F(2,80)=3.04 0.05 


Group x Protocol F(2,80)=3.39  <0.05 


4c AIN(vHipp) — Group (saline, cocaine) x Protocol (Control, 1 Hz, 13Hz) Group F(1,80)=13.5 — <0.001 
Protocol F(2,80)=5.6 <0.01 


Group x Protocol F(2,80)=2.07 ns 


4b Amplitude @ Group (saline, cocaine) x Protocol (control, 1 hz, 13 hz) Group F(1,94)=12.1 0.001 
+40 mV (mPFC) Protocol F(2,94)=0.6 ns. 


Group x Protocol F(2,94)=3.96 _<0.05 


RI(mPFC) — Group (saline, cocaine) x Protocol (control, 1 hz, 13 hz) Group F(1,94)=16.95 <0.001 
Protocol F(2,94)=2.3 0.1 


Group x Protocol F(2,94)=4.83 0.01 


4c AIN(mPFC) Group (saline, cocaine) x Protocol (control, 1 hz, 13 hz) Group F(1,94)=9.12  <0.01 
Protocol F(2,94)=2.2 ns 


Group x Protocol F(2,94)=2.65  <0.1 


Figure Measure Comparisons (ANOVA or t-test) Effect P 
5b Cocaine control vs. Saline control Group (saline, cocaine) x Lever (active x inactive) Lever F(1,53)=21.7 <0.001 
Group F(1,53)=16.8 <0,001 
Leverx Group — F(1,53)=22.9 <0.001 
Saline control active vs. inactive t(10)=0.43 ns. 
Cocaine control active vs. inactive 1(43)=9.5 <0.001 
Cocaine control vs. Saline control active lever comparison 1(53)=4.6 <0.001 
Cocaine control vs. Saline control_inactive lever comparison *(53)=1.54 ns. 
4 Hz vHipp vs. cocaine control Lever (active, inactive) x Protocol (Control, Laser) Lever F(1,53)=45.7 <0.001 
Protocol F(1,53)=4.7 <0.05 
Lever x Protocol F(1,53)=5.13 <0.05 
1 Hz vHipp active vs. inactive 1(10)=3.6 <0.01 
14 Hz VHipp vs. cocaine control active lever comparison 1(63)=2.3 <0,05 
4. Hz vHipp vs. cocaine control inactive lever comparison t(53)=1.1 ns. 
13 Hz vHipp vs. cocaine control Lever (active, inactive) x Protocol (Control, Laser) Lever F(1,51)=30.8 <0.001 
Protocol F(1,51)=4.12 <0.05 
Lever x protocol F(1,51)=6.41 <0.05 
13 Hz vHipp active vs. inactive {(8)=1.99 ns. 
13 Hz vHipp vs. cocaine control active lever comparison 1(51)=0.39 ns. 
13 Hz vHipp vs. cocaine control inactive lever comparison 1(51)=4.48 <0.001 
41. Hz mPFC vs. cocaine control Lever (active, inactive) x Protocol (Control, Laser) Lever F(1,53)=52.08 <0.001 
Protocol F(1,53)=1.1 ns. 
Lever x protocol _F(1,53)=3.08 ns. 
1 Hz mPFC active vs. inactive t(10)=4.16 0.002 
13 Hz mPFC vs. cocaine control __Lever (active, inactive) x Protocol (Control, Laser) Lever F(1,52)=23.2 <0.001 
Protocol F(1,52)=16.7 <0.001 
Lever x protocol F(1,52)=17.5 <0,001 
13 Hz mPFC active vs. inactive t(9)=1.74 ns. 
13 Hz mPFC vs. cocaine control —_active lever comparison 1(52)=4.29 <0,001 
13 Hz mPFC vs. cocaine control__inactive lever comparison *(52)=2.19 <0.05 
Sd 13 Hz mPFC vs. food control Lever (active, inactive) x Protocol (Control, Laser) Lever F(1,12)=163.7 <0,001 
Protocol F(1,12)=0.022 ns. 
Lever x protocol F(1,12)=0.41 ns. 
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Cell competition is a tumour suppressor 
mechanism in the thymus 


Vera C. Martins'*, Katrin Busch!, Dilafruz Juraeva’, Carmen Blum”, Carolin Ludwig’, Volker Rasche’, Felix Lasitschka”, 
Sergey E. Mastitsky*, Benedikt Brors?, Thomas Hielscher®, Hans Joerg Fehling? & Hans-Reimer Rodewald! 


Cell competition is an emerging principle underlying selection for cellular fitness during development and disease. Competition 
may be relevant for cancer, but an experimental link between defects in competition and tumorigenesis is elusive. In the 
thymus, T lymphocytes develop from precursors that are constantly replaced by bone-marrow-derived progenitors. Here we 
show that in mice this turnover is regulated by natural cell competition between ‘young’ bone-marrow-derived and ‘old’ 
thymus-resident progenitors that, although genetically identical, execute differential gene expression programs. Disruption of 
cell competition leads to progenitor self-renewal, upregulation of Hmgal, transformation, and T-cell acute lymphoblastic 
leukaemia (T-ALL) resembling the human disease in pathology, genomic lesions, leukaemia-associated transcripts, and 
activating mutations in Notch1. Hence, cell competition is a tumour suppressor mechanism in the thymus. Failure to select 
fit progenitors through cell competition may explain leukaemia in X-linked severe combined immune deficiency patients who 
showed thymus-autonomous T-cell development after therapy with gene-corrected autologous progenitors. 


Cell competition, a process favouring fit over weak or harmful cells, 
was originally identified in Drosophila’ °. Competition has been studied 
in tissue mosaics in which the interaction between genetically distinct 
cells differing in gene dosage eliminated one population (‘losers’) due to 
the presence of another (‘winners’). In cancer, following initial muta- 
genesis and transformation, competition may be involved in tumour 
cell selection, and progression”’. 

Studies in haematopoietic stem and progenitor cells provided evid- 
ence for p53-mediated cell competition*® and for the importance of the 
‘fitness’ of normal cells in preventing leukaemia from oncogene-bearing 
cells’. Because competition could provide a powerful mechanism of 
tissue surveillance, ‘natural’ cell competition (between genetically iden- 
tical cells) might be a tumour-suppressor mechanism ensuring normal 
cell turnover, and preventing accumulation of mutations in “‘unop- 
posed’ cells. Lack of competition may favour the formation rather than 
mere selection of faulty cells, but examples for natural cell competition, 
or consequences of its disruption, are elusive. 

Thymus function normally depends ona continuous supply of T-cell 
progenitors migrating from the bone marrow to the thymus’. However, 
when deprived of progenitors from the bone marrow, the thymus can main- 
tain autonomous T-cell development for several months*”. Hence, thy- 
mocyte turnover is regulated by bone marrow progenitor colonization. 
Acute lymphoblastic leukaemias (ALL) are the most frequent childhood 
malignancies. T-lineage ALL (T-ALL) arises by malignant transforma- 
tion of T-cell progenitors'®. Widespread genomic lesions in T-ALL re- 
flect genomic instability in T-cell progenitors"’, and often affect genes 
regulating T-cell development’’. Notably, human T-ALL frequently bear 
somatic gain-of-function mutations in NOTCH1'*™, Yet, clues on what 
goes astray when a normal thymus gives rise to leukaemia are lacking. 

We report here that thymocyte turnover is regulated by natural cell 
competition. When relieved from outside competition, intrathymic 
precursors persist, self-renew, and eventually leukaemia emerges that 
resembles human T-ALL. The system links impaired cell competition 
to tumour formation, and enables molecular analyses of the progression 
from normal T-cell development via pre-malignant stages to leukaemia. 


Progenitor deprivation causes T-ALL 


Thymus autonomy is induced by exposing a normal thymus to an 
environment deficient for T-cell progenitors*” (Fig. 1a). Consequences 
of sustained progenitor deprivation (Fig. 1a) were analysed in thymus 
grafts, endogenous thymi and host spleens (Fig. 1b). T-cell development 
persisted (CD4*CDs* double-positive (DP) thymocytes present), or 
was exhausted (recirculating CD4* and CD8" single-positive T cells 
present). CD4'°’*CD8* cells in the graft with large forward side scatter 
(not shown) suggested ‘pre-T-ALL’. Dissemination of these cells through- 
out lymphoid organs implied T-ALL development (Fig. 1b). The cells 
were mostly but not exclusively (Extended Data Fig. 1a) CD4'’* CD8* 
and had blast morphology (Fig. 1c). Mice developed tumours at the site 
of grafting (Extended Data Fig. 1b), splenomegaly (Fig. 1d), infiltra- 
tions into bone marrow and solid organs as seen by anaemic bone 
cavities (Fig. le), pale livers (Fig. 1f) and kidneys (not shown). Spleen 
volumes and cellularity were increased about tenfold compared with 
normal spleens (Extended Data Fig. 1c, d). Leukaemic cells proliferated 
(Extended Data Fig. le), and were broadly localized in the spleen (Ex- 
tended Data Fig. 1f). The liver showed perivascular and periportal infil- 
trates of CD8* neoplastic cells (Extended Data Fig. 1g, h). Primary T-ALL 
samples were transplantable into nude mice (Extended Data Figs liand 2). 
Because T-cell antigen receptor (TCR) B loci had one (12/20 T-ALL sam- 
ples) or two (8/20 samples) rearrangements (Extended Data Fig. 3), T- 
ALL originated from one or two clones at most. In 13/20 (65%) samples, 
we found at least one productive TCR-B rearrangement, suggesting that 
transformation can occur before or after pre-TCR selection”’. 


Bone marrow progenitor fitness and T-ALL development 
in the thymus 

We characterized the impact of the relative fitness of bone-marrow- 
derived progenitors on leukaemia development. Whereas progenitors 
in the thymus were always wild type, the fitness of bone marrow pro- 
genitors increased in the following order: Rag2-’y, ’ Kit’ < 
Raga y.— <y./ < Kit” < Ragl ‘~. These mutants are deficient 
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22 MAY 2014 | VOL 509 | NATURE | 465 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


@ Newborn thymus donor b Thymus d 
ae Graft Endogenous Spleen 
yo iF 04.7] 429.2 45] {199 


T-cell-progenitor-deficient 
recipient Rag2"y “Kit 


Thymus-autonomous 
T-cell production 


Persistent 


Exhausted 


=] 
Long-term fate of the Zz 
progenitor-deprived thymus? EB 
o 
Cc B6/Ly5.1 T-ALL a 
e 

ee * ose 
® : ez 
e Se hig 

® sila 
o __ i — ie 


Figure 1 | Progenitor deprivation causes T-ALL. a, Experimental design. 

b, Cell phenotypes, pre-gated for thymus donor origin, in thymus grafts, 
endogenous thymiand spleens of recipient mice. Numbers show percentages of 
cells in the quadrants. Data are representative for one out of two experiments 
analysing a total of 11 mice 16 to 18 weeks after transplantation. c, Cytospins of 
control (B6/Ly5.1) (left) and the T-ALL" spleens (right) (scale bar, 20 tm). 
d-f, Spleens (d), tibiae (e) and livers (f) from a control mouse (B6/Ly5.1) 
(right), and a T-ALL* recipient (left). Data shown (d-f) are representative for 
all T-ALL* mice. Scale bars, 0.5 cm in d, e; 1 cm in f. 


in responding to essential thymus cytokines, Kit ligand and interleukin 
7 (IL-7)’°, and/or have a developmental block caused by recombination 
activating gene (Rag1 or Rag2) ablation. In the least competitive envir- 
onment, 64% of all recipients developed T-ALL (Fig. 2a). Incidences 
were 50% in Rag2"y,", and 38% iny, ’ (mouse model of human 
X-linked severe combined immune deficiency (SCID)) hosts. Because 
the IL-7 receptor (IL-7r) is composed of IL-7ra plus y. chains, we tested 
117rx ‘~ hosts, which were unable to suppress leukaemia (Fig. 2a). Hence, 
the incidence of leukaemia was determined by the failure of bone- 
marrow-derived progenitors to outcompete intrathymic progenitors 
by IL-7 responsiveness. T-ALL did not develop in Rag! ’~ or Kit- 
deficient (Kit™/“”) mutants, in which the bone marrow fully replaces 
intrathymic progenitors®, and none of the thymus donor strains were 
‘prone’ for T-ALL (Fig. 2a) (Supplementary Discussion). 
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Figure 2 | T-ALL emergence and its prevention by bone marrow 
reconstitution. a, Kaplan-Meier plot of T-ALL incidence after grafting of 
wild-type thymi (as in Fig. 1a) into indicated recipients, and B6/Ly5.1 and 
F1(WB-Kit*’* X B6/Ly5.1) controls (n are indicated). b, Three groups of 
Rag2/~y. / Kit’ mice received first a wild-type thymus transplant, and 1, 
6, or 10 weeks later were reconstituted with wild-type bone marrow (BM) cells 
(Extended Data Fig. 4). The groups were followed for up to 32 weeks. The 
yellow curve summarizes the data for the observation period of 30 weeks for all 
analysed Rag2 ‘~y, ‘Kit’ not receiving bone marrow after thymus 
transplantation. 
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Iflack of replacement of intrathymic progenitors by competitive pro- 
genitors from the bone marrow was responsible for T-ALL develop- 
ment, restoration with competitive bone marrow Progenitors should 
prevent T-ALL. We reconstituted thymus-grafted Rag? ’~y./~ Kit” 
mice (Fig. 1a) with wild-type bone marrow at different time points (1, 6, 
or 10 weeks) after thymus grafting (Extended Data Fig. 4a). These reci- 
pients accept bone marrow without prior myeloablation”, avoiding 
effects that irradiation itself would have on leukaemia development. 
Only mice reconstituted with bone marrow 1 week after thymus trans- 
plantation were protected from T-ALL (Fig. 2b). Collectively, disease 
could be prevented by early restoration of normal competitive bone 
marrow, but by 6 weeks after onset of thymus autonomy transformation 
became irreversible (Extended Data Fig. 4b). 


Molecular characterization of T-ALL 


To monitor the emergence of T-ALL from the thymus, we transplanted 
reporter thymus harbouring a green fluorescent protein (GFP) trans- 
gene driven by the Rag2 promoter (RAG2p-GFP)’’, which allows visu- 
alization of thymic emigrants. At 17 weeks, GEP* CD4'’ * CD8* cells 
appeared in the blood (Extended Data Fig. 5a). One week later, this 
recipient showed clinical signs of disease, and leukaemic cells had dis- 
seminated into the bone marrow and spleen (Extended Data Fig. 5b). 
Based on this reporter, and on reverse transcription followed by PCR 
(RT-PCR) analysis (not shown), T-ALL continued to express Rag2 
outside the thymus, and after transfer into nude recipients (Extended 
Data Fig. 2a). 

To analyse the impact of Rag expression, which has been linked to 
oncogenic translocations’’, on leukaemia incidence (Fig. 3a, b) and on 
genomic lesions (Fig. 3c, d), we grafted Rag] ’~ TCR transgenic tg(OT-I) 
thymi into Rag2-’~y,’ Kit" mice. On a Rag-deficient background, 
the transgenic TCR promotes differentiation beyond CD3 CD4 CD8~ 
triple-negative (TN) stages. T-ALL development in Rag] ‘~ tg(OT-I) 
thymus grafts (Fig. 3a, b) excluded Rag expression as a prerequisite for 
leukaemia. Developmental progression beyond TN stages, a scenario 
consistent with T-ALL phenotypes (Fig. 1, Extended Data Fig. 1a), was 
also not required for transformation because Rag] ‘~ progenitors lack- 
ing the transgenic TCR developed T-ALL (Fig. 3a, b). Collectively, T-ALL 
had an immature CD8 single-positive (ISP)/DP-like surface phenotype, 
but transformation did not require progression beyond TN stages. 

In human and mouse T-ALL, genomic gains and losses reflect geno- 
mic instability’? **. Indeed, array comparative genomic hybridization 
(aCGH) of T-ALL revealed lesions (Fig. 3c) in genes known to be in- 
volved in cancer (Supplementary Tables 1 and 2). Hence, progenitors, 
once relieved from cell competition, become genomically unstable. Rag] 
expression contributed to illegitimate or enhanced DNA recombination 
because lesion numbers were lower in Rag!’ T-ALL (Fig. 3d). 

Typical for T-ALL, gain-of-function mutations in Notch] affect the hetero- 
dimerization (HD), transactivation (TAD) and PEST domains’**?° 
(Fig. 3e), resulting in ligand-independent signalling, or greater intra- 
cellular protein stability**”’. Indeed, we found at least one heterozygous 
mutation in 22/28 T-ALL (triangles in Fig. 3e). Affected domains, DNA 
mutation types and putative functional consequences (Extended Data 
Fig. 6a) were similar to mutations in human T-ALL™. In murine mod- 
els, 5’ genomic deletions in Notch1 drive alternative transcripts, result- 
ing in active intracellular Notch1***°. Such 5’ deletions were detected 
in 13/25 Ragl*’* mice but in none of the 5 Rag] ‘~ T-ALL (Extended 
Data Fig. 6b, c). Notch1 was expressed from a 5’ alternative promoter” 
in T-ALL but not in normal thymus (Extended Data Fig. 6d). Collectively, 
79% of T-ALL showed gain-of-function mutations (Rag] -independent), 
and 52% had 5’ deletions (Rag1-dependent) in the Notch1 gene (Fig. 3f). 


Intrathymic origin of T-ALL and self-renewal 


Leukaemia cells are considered developmentally ‘frozen’, and their phe- 
notype is thought to reflect their stage of origin. We used global gene 
expression profiling to compare T-ALL samples (n = 10) with cells from 
different stages of T-cell development, ranging from early thymic 
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Figure 3 | Molecular characterization of T-ALL. a, Thymi from 

newborn Ragl ‘~ tg(OT-I) or Ragi~’~ donors were transplanted into 
Rag2‘~y, ‘Kit’ mice (as in Fig. 1a). 8 weeks later, peripheral blood was 
analysed for T cells (upper row). At the onset of T-ALL, spleens were analysed 
(lower row). B6/Ly5.1 mice served as staining controls. Data are representative 
for T-ALL* mice indicated. b, Kaplan-Meier plots of T-ALL development in 
Rag2/~y.’ Kit™” recipients of wild-type, Rag] ‘~tg(OT-), or Rag] /— 
thymus grafts. None of the curves in b were significantly different. c, Rag] *’* 
(n = 7) and Ragl a ~ tg(OT-I) (n = 5) T-ALL samples were analysed by aCGH. 
Fractions of mice with gains (green) and losses (red) are shown by 
chromosomal positions. d, Number of lesions in T-ALL, excluding trisomy 15, 
were significantly higher in Rag1*/* compared to Rag’ tg(OT-I) T-ALL 
(Mann-Whitney-Wilcoxon test). e, HD, TAD and PEST domains of Notch1 
are depicted. Each triangle corresponds to one mutation. Each colour 
represents a single T-ALL (see Extended Data Fig. 6a). f, Frequencies of Notch 
mutations (exon 26/27/35; triangles in e), and deletions (A5’) (see Extended 
Data Fig. 6b, c) in T-ALL. 


progenitor (ETP), to TN CD44" CD25* (TN2), to CD44" CD25* (TN3), 
to CD44 CD25" (TN4), to ISP, to DP cells. Principle component ana- 
lysis (PCA) revealed the closest relationship of T-ALL to TN3/TN4/ISP 
stages (Fig. 4a), indicating that upon progenitor deprivation stages 
between TN3 to ISP enter self-renewal, and eventually undergo malig- 
nant transformation. Self-renewal was analysed by intrathymic injec- 
tion of TN cells (containing TN1-4), or more mature DP/SP positive 
cells, each sorted from 10-week-old autonomous thymus grafts (Fig. 4b). 
Whereas DP/SP donor cells retained their phenotype with no evidence 
for further development, and limited expansion in vivo (~ fivefold), TN 
donor cells gave rise to the major downstream stages (DP and SP), and their 
expansion was ~ 500-fold. The data imply that the self-renewing cells 
that maintain the residual thymus cellularity and T-cell development 
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Figure 4 | Origin of T-ALL, self-renewal in the thymus, and transcriptome 
analyses during disease progression. a, PCA for global gene expression of 
T-ALL samples (m = 10; appearing as one dot), and stages of T-cell 
development, indicated by unique colours for each population (n = 2) (see 
text). PC1 reveals the highest variance between leukaemic samples and 
thymocyte subpopulations. PC2 reflects progression along T-cell development, 
revealing the highest association of T-ALL to TN3/TN4/ISP stages. 

b, CD3 CD4- CD8~ TN, or CD4*CD8* DP together with CD4*CD8~ and 
CD4~ CD8* SP populations were sorted from 10-week progenitor-deprived 
thymi (n = 7) (left), reanalysed (middle), and injected intrathymically into 
irradiated recipients. Phenotypes of progeny from TN-injected (n = 3), or 
DP/SP-injected (n = 2) recipients were analysed on day 21 (right). Numbers 
above the panels are cell numbers injected (input), and retrieved (output) per 
mouse. c, PCA of transcriptomes revealed segregation by organ (thymus, 
spleen) and by condition (normal thymus, progenitor-deprived thymus grafts, 
T-ALL). Ragl*’* or Rag] /~ T-ALL are depicted in blue or red dots. 

d-f, Three pairwise comparisons were overlaid in a Venn diagram (Extended 
Data Fig. 7), revealing unique and common transcriptional changes. The heat 
maps show (threshold = fourfold) changes unique for the transition from 
the normal thymus (” = 2) to thymus grafts (n = 3) (d), changes maintained 
into the T-ALL stage (n = 7) (e), and changes exclusive for T-ALL (f). 


are contained in the TN compartment. It is difficult to distinguish self- 
renewal in non-transformed T-cell progenitors from pre-malignant 
behaviour of cells with retained differentiation potential. Nevertheless, 
these data are consistent with self-renewal in the TN compartment 
from which T-ALL arise (Fig. 4a). 
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Pre-malignant and leukaemic gene expression 
PCA of gene expression in normal thymus with regular progenitor turn- 
over, in thymus devoid of extrinsic progenitor competition for 10 weeks, 
and in fully malignant T-ALL (Fig. 4c-f) revealed sample clustering 
according to experimental groups, demonstrating consistent gene expres- 
sion changes during deprivation and subsequent transformation (Fig. 4c). 
Three pairwise comparisons (see Extended Data Fig. 7 for rationale of 
analysis) revealed transcriptional changes (threshold = fourfold) unique 
for the transition from the normal to the progenitor-deprived thymus 
(Fig. 4d), or maintained into the T-ALL stage (Fig. 4e). Changes exclus- 
ive for T-ALL (Fig. 4f) included many cancer- and leukaemia-associated 
genes (see Supplementary Discussion), notably Lmo2 and Tall (Fig. 4f). 
Lowering the threshold to = two-fold differential expression, and focus- 
ing on cancer-associated genes, uncovered additional information (Sup- 
plementary Tables 3-5). Ikaros (also known as Ikfz1), Pten and Rb1 
expression was reduced already in thymus grafts (Supplementary Table 3), 
whereas Lyll and Myc expression increased only in T-ALL (Supplemen- 
tary Table 5). In summary, the gene expression profile in T-ALL was, in 
part, already initiated in the thymus upon progenitor deprivation. 
Gene expression data (Fig. 4f) were used to correlate this murine T- 
ALL to subgroups of human T-ALL*’. Spearman rank correlation coef- 
ficients revealed significant correlations between our T-ALL and all human 
T-ALL subtypes, but not to human B-ALL” (Extended Data Fig. 8a, b), 
excluding the possibility that correlations between murine and human 
T-ALL reflect a ‘transformed state’. Important oncogenes in human 
T-ALL include TAL1, LMO2, TLX1 and LYL1. Tall and Lmo2 were 
strongly upregulated in the murine T-ALL, Lyl1 was also significantly 
but more weakly expressed whereas T/x1 was not expressed (Extended 
Data Fig. 8c). Hence, similar oncogenic pathways were found in T-ALL 
arising in the bone-marrow-progenitor-deprived thymus compared to 


human subgroups of T-ALL, but not B-ALL. The homologues of the 
most commonly found oncogenes in human T-ALL, Tall and Lmo2, 
were strongly induced in the murine T-ALL. 


Hallmarks of cell competition driving thymocyte 
turnover 

Thymus autonomy*” and subsequent T-ALL development occur when 
bone marrow progenitors are unresponsive to IL-7, whereas intrathy- 
mic progenitors receive IL-7 receptor signals (Fig. 2). We constructed 
mixed chimaeras composed of titrated numbers of wild-type (yt/*) 
and IL-7r-deficient (ye / ~) donor bone marrow, which revealed the 
sensitivity and stage (TN2/TN3) of IL-7-driven selection of responsive 
over unresponsive precursors (Extended Data Fig. 9). 

Kinetic analysis of the physiological replacement of thymus-resident 
by bone-marrow-derived progenitors (both wild type) revealed day 21 
as the ‘turning point for total thymocytes (Fig. 5a), and day 14 for TN2/ 
TN3 cells (Fig. 5e). The turnover was completed by day 35 (Fig. 5b, f). 
By comparison, when intrathymic progenitors (wild type) lacked com- 
petition from bone marrow progenitors (Rag2 ’~y, ’ Kit” hosts), 
both total thymocytes (Fig. 5c) and TN2/TN3 cells (Fig. 5g) persisted 
beyond day 35. The fact that one cell population (thymus-resident 
progenitors) was lost in the presence, but not in the absence of another 
population (bone-marrow-derived progenitors), indicates that normal 
T-cell progenitor turnover is regulated by cell competition. The extent 
of the competition-dependent cell loss is illustrated by the shaded areas 
under the curves (Fig. 5d, h). 

Thymus-resident and bone-marrow-derived TN2/TN3 cells differ 
in their time of residency in the thymus (Fig. 5e), and therefore in the 
duration for which they have been engaged in T-cell development. We 
therefore operationally refer to thymus-resident TN2/TN3 cells as 
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were overlaid in a Venn diagram (Extended Data 
Fig. 10a). Including additionally the comparison 
of control TN2 versus TN3 cells allowed exclusion 
of changes associated with the TN2-to-TN3 
transition. This yielded lists of competition- 
responsive genes (j), genes specific for old cells 

in the absence of competition (k) and genes 
associated with old cells regardless of 
competition (1). 
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‘old’, and to bone-marrow-derived TN2/TN3 cells as ‘young’. We 
analysed global gene expression for old TN2/TN3 cells that were 
under competition from young TN2/TN3 cells (as in Fig. 5f) or not 
(as in Fig. 5g), and for young TN2/TN3 cells. PCA revealed distinct 
clusters for young and for old TN2/TN3, whereas the clusters for old 
TN2/TN3 with or without competition overlapped (Fig. 5i). Pairwise 
comparisons (Extended Data Fig. 10a) revealed the impact of com- 
petition on old TN2/TN3 cells. Using a threshold of = twofold, we 
identified 9 genes with specifically altered expression in old TN2/TN3 
cells under competition (Fig. 5j). Interestingly, the anti-apoptotic gene 
Bcl2 was among these genes, and this translated into reduced expres- 
sion of the Bcl2 protein. Consistently, frequencies of annexin™ cells 
were increased in old compared to young TN2/TN3 cells (Extended 
Data Fig. 10b—d). Because Bcl2 expression is downstream from IL-7r°’, 
these data suggest a link between the natural replacement process and 
the observed effects of IL-7r signalling on competition®’. The func- 
tional implications of other identified genes, provisionally referred to 
as ‘competition-regulated genes’ (Fig. 5j), will require further studies. 

Changes specific for old TN2/TN3 cells in the absence of competi- 
tion were also identified (Fig. 5k). Remarkably, the most upregulated 
gene was high-mobility group Al (Hmga1), a transcriptional master reg- 
ulator overexpressed in stem cells and in many cancer types, including 
human T-ALL**”. Finally, we also identified 19 genes that were differ- 
entially expressed comparing young and old T-cell progenitors, regard- 
less of competition (Fig. 51). 


Discussion 


Our findings identify cell competition as a mechanism driving the phys- 
iological substitution of thymus-resident progenitors (‘old’) by bone- 
marrow-derived (‘young’) progenitors. We have characterized this dynamic 
process in kinetic and quantitative terms. The results demonstrate that 
old and young progenitors, although genetically identical and pheno- 
typically positioned at the same developmental stage, differ selectively 
in gene expression, which may reflect their distinct past and current ‘cel- 
lular experiences’, such as dwell times in the thymus, cumulative pro- 
liferation numbers, cellular or genetic insults suffered during V(D)J 
recombination and DNA repair, growth factor dependency, stromal 
niche support, and others. Levels of Bcl2 were reduced in old compared 
to young progenitors, which was consistent with higher frequencies of 
apoptotic cells in the old compared to the young. Because Bcl2 express- 
ion is downstream of IL-7r signalling, lower Bcl2 expression suggests 
that IL-7 availability is limited for, or less utilized by, old progenitors. 
This provides an attractive link between the experiments in which IL-7- 
unresponsive bone marrow progenitors could not inhibit thymus auto- 
nomy*” and natural cell competition, in which both thymus and bone 
marrow progenitors are wild type. We took advantage of mice lacking 
T-cell progenitors in the bone marrow to relieve old wild-type progen- 
itors in the thymus from competition. In such ‘unopposed’ old pro- 
genitors, we identified several genes that partially resemble the ‘young 
state’ (Fig. 5j), and others that distinguish specifically old progenitors 
with competition from those without (Fig. 5k). Hence, old progenitors 
seem to sense and react to competition by young progenitors. 

The consequences of disrupted cell competition on intrathymic T- 
cell progenitors are of particular interest. Already after 14 days without 
competition, old cells strongly upregulate Hmgal, a key regulator of 
stemness that is highly expressed in many cancers. Hmga1 encodes non- 
histone chromatin proteins that act as architectural transcriptional fac- 
tors. These not only regulate diverse developmental processes but also 
orchestrate malignant transformation. Hmgal transgenic mice develop 
aggressive T-ALL that is accelerated by loss of Cdkn2a***, one of the loci 
affected in our model (Supplementary Table 2). At the cellular level, 
progenitor deprivation enables self-renewal at TN stages, as shown by 
TN2/TN3 persistence in the absence of competition, and the fact that 
even after 10 weeks without new progenitor import TN cells gave rise to 
DP and SP stages upon adoptive transfer. By this time, many genes are 
deregulated compared to a normal thymus, with some changes carried 
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over into T-ALL. On the basis of clinical, pathological, cellular and 
molecular criteria, T-ALL developing in mice by disruption of cell com- 
petition in the thymus share many properties with human T-ALL. 
Collectively, we propose that cell competition protects the T-cell pro- 
genitor population from in situ ageing, and in this manner is a tumour- 
suppressor mechanism. 

Promotion of leukaemia by thymus autonomy could explain T-ALL 
in X-linked SCID patients in the wake of gene therapy. X-linked SCID 
patients carry germline mutations in y,, blocking early T-cell develop- 
ment in the human thymus. Following transplantation of autologous, 
gene-corrected stem or progenitor cells without myeloablation, all T cells, 
but no myeloid or B cells, were derived from y,* transplanted cells**. Vector 
integration near Lmo2 has been held responsible for T-ALL that devel- 
oped in 5/20 treated patients**”’, but leukaemias also harboured wide- 
spread genomic abnormalities unrelated to the viral integration”. 
Given the sustained T-cell development in the patients’ thymus, with 
no signs of bone marrow engraftment, and no evidence for T-cell pro- 
genitor replenishment, y.* progenitors probably only colonized the 
thymus where they prevailed long-term over y, (endogenous) progeni- 
tors. Our transplantation of wild-type thymus (containing competent 
yer progenitors) into ye! ~ mice closely reflects this situation. In con- 
trast, none of the patients receiving gene therapy to correct adenosine 
deaminase (ADA) deficiency developed leukaemia, and all were pre- 
conditioned to permit HSC engraftment. We conclude that lack of 
progenitor supply from the bone marrow, combined with autonomous 
T-cell development in the thymus, favours progenitor transformation. 
Similarly, mutations in mice associated with development of leukaemia, 
notably Lmo2 overexpression” or loss of Tcf1 (also known as Hnfla)”, 
may force T-cell progenitors into self-renewal and enhance their com- 
petitiveness. This enhanced competitiveness then prevents normal cell 
turnover and promotes transformation. Altogether, the findings reported 
here for the thymus link defects in cell competition to tumorigenesis. It 
might be interesting to apply this framework to cancers arising in other 
cells or tissues maintained by stem and progenitor reservoirs. 


METHODS SUMMARY 


All mouse experiments were approved by Regierungsprasidien Tiibingen or Karlsruhe. 
Recipient mice were C57BL/6, Rag2/~y. / Kit", Kit”, Rag2/~y.", 
pan, Ura, Ragl ae Ragl ca tg(OT-I) (all CD45.2*). Thymus donors were 
B6/Ly5.1 (H-2°; CD45.1*), Fl(WB-Kit‘’* X B6/Ly5.1), F1(WB-Kit’’* x B6/Ly5.1- 
RAG2p-GEP transgenic), or Rag], or Rag] ’~ tg(OT-I) mice. Survival curves and 
statistical analyses were calculated with Prism 5. For transcriptome analysis RNA 
was isolated using RNAzol B (IsoTex Diagnostics). Labelled complementary RNA 
was hybridized to Illumina MouseWG-6 v2.0 BeadChips. Data were analysed with 
Chipster v2.9. For aCGH, labelled DNA was hybridized to Agilent Mouse Genome 
aCGH Microarrays. Analyses were carried out using R 2.15/Bioconductor 2.11 
with add-on packages limma, aCGH, snapCGH, DNAcopy and cghMCR. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. Recipient mice were C57BL/6, Rag2 eh Ty ~ Kit”, Kit”, Raga’ “yy! ., 
ye, Ura’, Ragl mae Ragl mie tg(OT-I) (all CD45.2*) as described!”4!~“*, 
Histocompatible thymus donors were B6.SJL-Ptprc* Pep3°/Boy] (H-2°; CD45.1*) 
(termed B6/Ly5.1), or FI(WB-Kit'’* X B6/Ly5.1), or FI(WB-Kit'’* x B6/Ly5.1- 
RAG2p-GFP transgenic’), or Rag’, or Rag! ’~ tg(OT-I) mice. Both male and 
female mice were used at ages ranging from 6 to 20 weeks. No randomization and 
no blinding was used. All mouse experiments were approved by the Regierungsprasidien 
Tubingen or Karlsruhe. 

Thymus transplantation and recipient analyses. Two thymus lobes were grafted 
as described* at the poles of the kidney of one recipient. Group sizes of approxi- 
mately ten mice were used to detect leukaemia incidence rates with sufficient reli- 
ability. Assuming an incidence rate of 50% and allowing for a confidence interval 
width of 50%, that is, 50% + 25%, 10 mice were required to control the type I error 
rate at 10%. With group sizes of at least 10 mice, we were able to detect a 4.5-fold 
increased risk as well as a 90% risk reduction for time to T-ALL development with 
80% power in a two-group comparison based on the log-rank test, assuming a 
median time of 30 weeks in the standard group and a type I error rate of 5%. In the 
case of Rag! ’ recipient mice, only n = 5 were used (Fig. 2) in view of the fact that 
these hosts suppress thymus autonomy which is a prerequisite for T-ALL develop- 
ment. Although the n = 8 for y, ’~ recipients is slightly below 10, the combined 
evidence obtained together with data from I7ra/— recipients (n = 12) under- 
scores the role of the Il7r in cell competition. Group sizes for control mice were 
greater than n = 30 to exclude even rare incidence of leukaemia in these strains. 
Survival curves were calculated with Prism 5. 

Flow cytometry. Organs were harvested and single-cell suspensions prepared in 
PBS/5%ECS. Cells were blocked with 100 1g ml~' mouse IgG (Jackson Immuno- 
Research) for 15 min and stained for 30 min in an appropriately diluted antibody 
staining solution in PBS/5%FCS. Reagents used were CD3 allophycocyanin (APC)- 
Cy7 (17A2), CD4 phycoerythrin (PE)-Cy7 (GK1.5), CD8 APC (53-6.7), CD44 
PerCP-Cy5.5 (IM7), CD45.1 PE (A20), CD45.1 PE-Cy7 (A20) (eBioscience), CD3 
PE (145-2C11), CD4 PE, CD8 PE (53-6.7), CD11c PE (HL3), CD19 PE (1D3), CD25 
PE (PC61), CD45.1 bio (A20), CD45.1 FITC (A20), CD45.2 PerCP-Cy5.5 (104), 
CD117 APC (2B8), NK1.1 PE (PK136), Grl PE (RB6-8C5), Ter119 PE (Ter119), 
Ki67 Alexa Fluor 647 (B56), Bcl2 PE (clone 100) (BD Pharmingen), CD11b PE (M1/ 
70.15), Scal PE-Cy5.5 (Ly-6A/E), CD4 QDot605 (RM4-5) and Streptavidin QDot605 
(Invitrogen/Molecular Probes). For definition of HSC, the lineage cocktail (Lin) 
was composed of CD3, CD4, CD8, CD11b, CD11c, Grl, CD19, NK1.1, and Ter119. 
In the thymus, sorted populations were defined as follows: ETP (Lin CD44* 
CD25" Kit"), TN2 (lin CD44*CD25*), TN3a(CD4- CD8 CD3 CD44" CD25* 
CD28), TN3b (CD4-CD8 CD3 CD44 CD25*CD28"), TN4 (CD4- CD8~ 
CD3~ CD44 CD25" CD28"), ISP (CD4~ CD8* CD24* TCRB), DP (CD4*CD8*). 
For staining of blood cells, blood was collected from the tail vein directly into 
potassium-EDTA containing tubes (Sarstedt), and 500 pl of PBS/5%FCS were 
added to dilute the blood. FICOLL-Paque PLUS (GE Healthcare) was added below 
the cell suspension. Samples were centrifuged at 250g for 20 min at room temper- 
ature, the interface was collected, and cells were washed with PBS/5%FCS. Staining 
was performed as indicated above. Annexin staining (Annexin V Apoptosis Detec- 
tion Kit, eBioscience) was performed according to the manufacturers indications, 
with the antibodies used for the extracellular staining having been combined with 
annexinV in a 1-step staining. 

Magnetic resonance imaging (MRI). 10 Foxn 1" (nude) mice with T-ALL and 
2 healthy control nude mice were killed and immediately analysed in a dedicated 
high-field smallanimal MRI system (BioSpec 117/16, Bruker Biospin, Germany, 
Ettlingen) with a 4cm quadrature volume resonator in coronal slice orientation. 
The MRI imaging protocol comprised a multi-slice T1-weighted RARE sequence 
with acquisition parameters as: TE/TR = 16 ms/600 ms, RARE factor = 8, Ax/ 
Ay/Az = 150/150/1,000 pm? with an additional inter-slice gap of 400 um. For 
volume quantification the data were transferred to a research software (Segment, 
MEDVISO AB, Sweden, Lund) and the spleen was manually contoured. 
Histology and immunohistochemistry. Cryosections (5 ,1m) of spleen and liver 
of T-ALL” recipients, and healthy controls were air-dried, and either immedi- 
ately used for haematoxylin and eosin (H&E) staining or dehydrated in acetone 
for 8 min, air dried, and stored at —80 °C for subsequent immunohistochemistry. 
H&E sections were stained for 8 min in haematoxylin (Merck), followed by rinse 
and wash for 5 min in tap water, and further stained in 0.25% eosin (Sigma) for 
3 min. Slides were dipped into 75% ethanol to remove excess of eosin, kept for 
1 min in 96% ethanol, 3 min in 99% ethanol (twice), 5 min in xylol (twice), and 
mounted with Entellan (Merck). For the immunohistochemistry of the liver, 
slides were rehydrated for 5 min in PBS and blocked in PBS 0.2% Tween 1% 
BSA 10% normal goat serum (Chemicon-Millipore) for 1 h at room temperature. 
Staining was performed in blocking buffer with diluted purified anti-CD8 antibody 
(53-6.7, Biolegend) overnight at 4 °C, followed by 3 washes in PBS 0.2% Tween, 
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and detection with goat anti-rat Cy3 (Biolegend) for 1 h at room temperature. Stained 
sections were washed, dried and mounted with Fluoromount G (Southern Biotech) 
containing 1.5 1g ml’ Dapi (Roth). For immunohistochemistry of spleen sections, 
slides were rehydrated and blocked at room temperature for 30 min in PBS 2% BSA 
0.1% NaN 100 pg ml” * mouse IgG (Jackson Immunoresearch). Staining was per- 
formed in PBS 2% BSA 0.1% NaN3 with diluted biotinilated-CD4 (GK1.5), CD8 
(53-6.7), B220 FITC (RA3-6B2) and Ter119 APC (Ter-119) for 1 h at room tem- 
perature in a wet chamber, followed by rinse and wash in PBS for 10 min. Streptavidin 
Alexa594 staining followed for 1 h at room temperature in a wet chamber. Slides were 
washed in PBS for 10 min, and stained in 1 mol 4’ ,6-diamidino-2-phenylindole 
(DAPI) in PBS for 10 min at room temperature. Stained sections were washed, and 
mounted with Fluoromount G (Southern Biotech). 

Genomic DNA and array-CGH (aCGH). For preparation of genomic DNA, 
5 X 10° splenocytes were spun down, digested with proteinase K, and genomic 
DNA was purified with phenol-chloroform, and diluted in water. The DNA inte- 
grity was assessed by agarose gel electrophoresis and 2 1g DNA were used for 
digestion with Alul and Rsal. The digested DNA was labelled with Cy5 (tumour 
DNA) or Cy3 (control DNA) with Genomic Enzymatic Labelling kit (Agilent Tech- 
nologies), and quantified with a ND-1000 photometer (NanoDrop Technologies). 
Labelled DNA was hybridized to Agilent Mouse Genome aCGH Microarrays (Agilent 
Technologies) according to the manufacturer’s guidelines. All steps following DNA 
purification were performed at Miltenyi Biotec GmbH. 

For aCGH data analyses, raw intensity values were preprocessed using ‘nor- 
mexp’ background correction and loess normalization, with consequent log, 
transformation of the data. Replicated probes were averaged, and probes with 
more than 10% missing values were removed. Missing values were imputed using 
a smoother to account for genomic structure. Circular binary segmentation (CBS) 
was used to detect change points in copy number states and to calculate smoothed 
intensities within segments. Copy number states that had close-by mean intens- 
ities were further merged after segmentation. Thresholds for defining copy num- 
ber changes were set at + 0.6 for log, smoothed intensities. All analyses were 
carried out using R 2.15/Bioconductor 2.11 with add-on packages limma, aCGH, 
snapCGH, DNAcopy and cghMCR. 

RNA, complementary DNA synthesis and RT-PCR. RNA was isolated using 
RNAzol B (IsoTex Diagnostics) according to the manufacturer's recommenda- 
tions and dissolved in RNase free water. For CDNA synthesis, a DNase I (Invitrogen) 
digest step was performed, cDNA was synthesized with the transcriptor high fidelity 
kit (Roche) using oligo dT primers. Expression of canonical and alternative Notch1 
transcripts, as well as Hes! and f actin, was analysed by RT-PCR. For PCR amp- 
lification from cDNA, we used the following primers and conditions: Notch 1E1aE6 
F; 5'-CCTCTTCCCTTCCAGGTCCATC-3’, R: 5’-CAGGCATTGGGCATGAG 
CTGA-3', PCR product: ~1,300 bp, annealing temperature 60 °C; Notch1E1E6 F: 
5'-GCGAGCGCAGTGAAGGAACGA-3’, R: 5’-CAGGCATTGGGCATGAGCT 
GA-3’, PCR product: 1,115 bp, annealing temperature 65 °C; f actin F: 5’-CATC 
ACTATTGGCAACGAGC-3’, R: 5’-ACGCAGCTCAGTAACAGTCC-3’, PCR pro- 
duct: 410 bp, annealing temperature 60 °C; Hes1 F: 5'-ATGCCAGCTGATATA 
ATGGAG-3’, R:5'-ACGCTCGGGTCTGTGCTGAGC-3’, PCR product: 317 bp, 
annealing temperature 60 °C. 

Analyses of deletions and mutations in Notch1, and of TCR-B rearrangements. 
Genomic DNA was purified as described above. Genomic regions were amplified 
by PCR using the Expand Long Template PCR system (Roche). The amplified bands 
were cut from agarose gels, purified with Geneclean Spin Kit (MP Biomedicals), 
and either used for direct sequencing (deletion of the Notch1 promoter), or cloned 
into pCR2.1-TOPO vector (Invitrogen) (Notch1 exons 26, 27 and 34, and TCR-B 
rearrangements). At least 10 individual minipreps were sequenced for each cloned 
PCR product. For TCR-B, all analysed T-ALL samples had either one or two 
rearrangements. Sample #89 yielded three PCR products (Extended Data Fig. 3a). 
However, the two bigger PCR products corresponded to the same TCR sequence 
which was amplified by two different VB primers (one of which, VB21, was cross 
reactive). Hence, also T-ALL #89 had only two distinct TCR-B rearrangements. 

Deletions in the Notch1 locus were analysed according to Ashworth et al.” 
by PCR on genomic DNA from T-ALL and from control DNA provided by T. 
Ashworth and J. Aster. Primers and PCR conditions used were: ANotch1 F: 5’- 
ATGGTGGAATGCCTACTITTGTA-3’, R: 5'-CGTTTGGGTAGAAGAGATGC 
TTTAC-3’, PCR product: ~500 bp, annealing temperature 58 °C; gp130 F: 5'- 
GGCTTTTCCTCTGGTTCTTG-3’, R: 5’-CAGGAACATTAGGCCAGATG-3’, 
PCR product: 615bp, annealing temperature 60°C. Mutations in exons 26, 27 
and 34 of the Notch1 locus were analysed by PCR on genomic DNA as described 
by O'Neil ef al.” using the following primers and PCR conditions: Notch1 exon26 F: 
5'-ACGGGAGGACCTAACCAAAC-3’, R: 5’-CAGCTTGGTCTCCAACACCT-3’, 
PCR product: 792 bp, annealing temperature 60 °C; Notch1 exon 27 F: 5'-CGCTGA 
GTGCTAAACACTGG-3’, R: 5’-GTTTTGCCTGCATGTACGTC-3’, PCR product: 
505 bp, annealing temperature 60 °C; Notch1 exon 34.1 F: 5'-GCTCCCTCATGTA 
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CCTCCTG-3’, R: 5’-TAGTGGCCCCATCATGCTAT-3’, PCR product: 904 bp, 
annealing temperature 62 °C; Notch1 exon 34.2F: 5'-ATAGCATGATGGGGCC 
ACTA-3’, R: 5’-CTTCACCCTGACCAGGAAAA-3’, PCR product: 893 bp, an- 
nealing temperature 62 °C. TCR rearrangements were analysed by PCR on geno- 
mic DNA using a pool of all Vf (nomenclature according to http://www.imgt.org) 
forward primers with either JB 1.7 or Jh2.7 reverse primers. We used the following 
primers and PCR conditions: 

VB primers: V1: 5'-GCTGGAGCAAAACCCAAGGTG-3’, V2: 5'-CTGGT 
GGCAGTCACAGGGA-3’', V3: 5'-GGGCAGATGGTGACCCTCAA-3’. 

Vp4: 5'-CCAAGGCGCTTCTCACCTCA-3’, V5: 5’-GCCGCCAGAGCTC 
ATGTTTC-3', V$12.1, VB 12.2, VB12.3: 5'-CTGGGGTTGTCCAGTCTCCA-3’, 
VB13.1, VB13.2: 5'-GCTGCAGTCACCCAAAGCC-3’, V$13.3: 5'-GCTGCAGT 
CACCCAAAGTC-3’, VB14: 5'-GCTGGAGTCACCCAGTCTC-3’, VB 15: 5’-GC 
TGGAGTCACCCAGTCTC-3’, VB 16: 5'-GGCACAAGGTGACAGGGA-3’, VB17: 
5'-GCCTCTGCAGGCCTAGAGT-3’, VB19: 5'-GGTACCGACAGGATTCAGGG- 
3', VB20: 5'-CCCATCAGTCATCCCAACTTAT-3’, VB21: 5'-GGGGTTGTCCAG 
AATCCTAGACA-3', VB23: 5'-CCCCTCCAGCTCACTCTGCA-3’, V24: 5'-GG 
GCACACTGCCTTTTACTGGT-3’, V26: 5'-CTGAGGCAGGAGACTCAGCA- 
3', VB29: 5'-GGAATGTGGACAGGACATGAGC-3’, V$30: 5'-CCAGTGTGTG 
GCTGACAGTCA-3’; Jf primers: JB1.7: 5’-CCAAGACCATGGTCATCCAAC-3’, 
JP2.7:5'-TGAGAGCTGTCTCCTACTATCGATT-3’. All Vf (forward) primers were 
pooled for the PCR reactions and each was used at a final concentration of 0.05 uM. 
Reverse primers for either JB 1.7 or J[2.7 were added to the reaction at a final concen- 
tration of 0.5 uM. PCR cycling conditions were: 94°C for 2 min, 35 cycles of 94 °C for 
30s, 58 °C for 30s, 72 °C for 2 min followed by a final extension of 5 min at 72 °C. 
Transcriptome analysis. For microarray experiments, at least 2 biological replicates 
per group were used. This limited group size was considered adequate to globally 
characterize groups and to detect strongest regulated features as groups were rather 
homogeneous. Because heterogeneity was unknown for the test groups, larger 
numbers of samples were analysed. RNA was isolated using RNAzol B (IsoTex 
Diagnostics). All samples were subject to quality control. CRNA was synthesized 
and hybridized to Illumina MouseWG-6 v2.0 BeadChips (at the Microarray Facility 
of the German Cancer Research Center). 

Data were normalized and analysed with Chipster v2.9. A principal component 
analysis was applied so that principal components 1 and 2 explain 80% of the 
variance of the data. 

The clustering of the samples within groups when displaying principal com- 
ponent 1 versus 2, confirmed that the data could be grouped together for further 
analysis. Samples that did not cluster with their experimental groups where kept 
apart and not taken for subsequent analyses. This is the case of one thymus graft 
(shown in Fig. 4c) that presented higher similarities to normal thymus than to the 


remaining thymus grafts. Another case is one of the ‘old without competition’ 
samples, that does not cluster with any of the remaining samples. Pairwise com- 
parisons were carried out between transcriptomes of experimentally defined 
groups. Resulting lists of differentially expressed genes were overlaid in Venn 
diagrams and changes that were either specific or common between comparisons 
were identified. Ingenuity Pathway Analysis software was used to identify cancer- 
associated genes. 

Comparison of the murine T-ALL signature to the transcriptomes of human 
T-ALL and human B-ALL. All statistical analyses were carried out using R, ver- 
sion 3.0.0. Published human transcriptome data sets were retrieved from the Gene 
Expression Omnibus data repository (http://www.ncbi.nlm.nih.gov/geo/) for T-ALL 
(GSE26713) and B-ALL (GSE13351). The complete data was normalized and pre- 
processed as described in the original work (ref. 31), and sample clustering was 
analysed using the method of average-linkage hierarchical clustering with Euclidean 
distance. The human orthologues of the murine genes displayed in Fig. 4f were 
retrieved from Ensembl (Ensembl 74, NCBI37), and most murine genes (59 out of 
64) could be matched to the corresponding human genes. The mouse genes were 
mapped to 104 probe IDs in the mouse microarray data using R annotation pack- 
age (illuminaMousev2.db, version 1.20.0), and the corresponding human orthologues 
were then mapped to 127 probe IDs using R annotation package (hgu133plus2.db, 
version 2.10.1). To perform correlation analysis, we computed a centeroid for each 
subtype of disease at two steps. First, probes were combined to genes based on Ensembl 
gene IDs, and each gene was represented by the intensity value of the probe with 
the highest variance across all samples in the data set (WGCNA R library, version 
1.34). Second, the first principle component of the gene expression for each gene in 
each subtype was calculated. The Spearman rank correlation test was performed 
using the R library stats (version 3.0.0) and permutation P values were calculated 
using Spearman test of independence based on Monte-Carlo resampling (coin R 
library, version 1.0-23). 
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Extended Data Figure 1 | Characterization of T-ALL. a, Splenocytes from 
Rag2‘~y.’ Kit”’/” recipient mice with T-ALL were gated for CD45.1* 
thymus donor cells, and analysed for CD4 and CD8 expression. All T-ALL 
samples were analysed in this manner. b, RAG2p-GFP reporter thymus-grafted 
(top), and non-transplanted (bottom) kidneys of a T-ALL” recipient. Bright 
field and fluorescent images superimposed in the top panel (scale bars, 5 mm). 
All T-ALL* mice had thymic tumours, and all RAG2p-GFP reporter thymus- 
derived T-ALL expressed Rag2 based on in situ fluorescence (shown here), or 
based on flow cytometry (Extended Data Fig. 5). c, Volume of spleens by 
magnetic resonance imaging (MRI). d, Spleen cellularity. e, Ki67 expression in 
CD4* and CD8* control splenocytes (left panel), and in T-ALL samples; 
each colour is an individual tumour (right panel). Control splenocytes or 
T-ALL cells unstained for Ki67 are shown in the grey shaded curves. 

f, Immunostaining of spleen cryosections of the indicated mice (n = 2 for 
T-ALL, and n = 1 for the controls) stained for T cells (CD4 plus CD8 in red), B 
cells (B220 in green), and erythrocytes (Ter119 in blue). Scale bar, 200 jim. 

g, Liver cryosections of the indicated samples (n = 4 for T-ALL, and n = 1 for 
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the control) were stained with haematoxylin and eosin. Mice with T-ALL had 
perivascular and periportal infiltrates of amonomorphous small neoplasm that 
infiltrated into the adjacent liver parenchyma. At higher magnification (not 
shown), the neoplastic cells appeared medium-sized to large and showed a 
pronounced nuclear pleomorphism indicated by irregular edged nuclear 
contours with coarsely plumped chromatin. Cell borders were distinct, and a 
cytoplasmic rim was nearly invisible. These histomorphological findings were 
consistent with liver infiltration of a malignant haematopoietic neoplasm. Scale 
bar, 400 jum. h, Liver cryosections of the indicated samples (n = 2 for T-ALL, 
and n = 1 for the control) were immunostained for CD8 (red) and counter- 
stained for DAPI (blue). Scale bars, 50 jim. i, MRI analysis of T-ALL. 
Splenocytes from T-ALL* mice were injected intravenously into athymic nude 
(Foxn1"”"") mice. T-ALL* nude mice were killed and immediately scanned by 
MRI (n = 10). Images shown are from a representative T-ALL* mouse in two 
levels, revealing enlarged kidneys, liver, spleen and lymph nodes reflecting 
T-ALL dissemination (upper panels). Images from a nude mouse not receiving 
leukaemic cells (n = 2) are shown as controls (lower panels). 
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a B6/Ly5.1 nude nude + T-ALL 
N 22.2 .4|24.2 


88.9 99.4! 


RAG2p-GFP 
b 1 Cells 2° T-ALL 

T-ALL transferred recipients pos time (d) 
#15 10° 4 3 15-23 
#15 10° 4 4 24 
#15 10° 4 4 24 
#31 10° 2 0 126 
#34 10° 5 5 18-22 
#35 10° 2 2 35845 
#47 10° 4 4  27&33 
#51 10° 2 ) 122 
#53 10° 4 ) 105 
#55 10° 3 3 33 
#68 10° 2 2 22 
#69 10° 2 2 44 
#74 10° 2 2 50 
#78 10° 2 2 27 
#81 10° 2 2  44&50 
#82 10° 2 1 27 
#88 10° 2 2 4 
#116 10° 2 2 37 
#332 10° 3 3 35 
#341 10° 2 2 28 
#342 10° 2 0 37 
#343 10° 3 3 49 
#347 10° 2 3 21 


Average day of onset = 32 days 


Extended Data Figure 2 | Transplantation of T-ALL into nude (Foxn1"”"") 


mice. a, Analyses of CD4 and CD8 expression (top row), and CD8 and the 
RAG2p-GFP reporter (bottom row) in the spleens of a normal B6/Ly5.1 mouse 
(left), an unmanipulated nude mouse (middle), and a nude mouse 3 weeks after 
receiving 10° splenocytes from a T-ALL* Rag2 /~y, ‘Kit’ recipient 
(right). Data are representative for recipients listed in b. b, Summary of T-ALL 
sample numbers (primary (1°) T-ALL), numbers of cells transferred per nude 
mouse (cells transferred), numbers of nude mice injected per T-ALL sample 
(secondary (2°) recipients), numbers of nude mice developing T-ALL (T-ALL 
positive), and time in days until onset of the disease, or end of observation 
period (time). 
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b T-ALL sample 
#31 
#34 
#35 (1) 
#35 (2) 
#47 
#51 
#55 (1) 
#55 (2) 
#65 
#68 
#69 
#70 
#71 
#74 (1) 
#74 (2) 
#78 (1) 
#78 (2) 
#81 (1) 
#81 (2) 
#82 
#86 
#88 (1) 
#88 (2) 
#89 (1) 
#89 (2) 
#106 (1) 
#106 (2) 
#116 


Vp - JB 
VB13-1 JB2-7 
VB19 Jp1-6 
VB15 Jp2-3 
Vp13-2 JB2-3 
Vp23 Jp1-1 
VB13-1 JB2-5 
Vp23 Jp1-1 
VB13-2 JB2-7 
VB3 Jp2-4 
VB2 Jp2-7 
VB19 Jp1-2 
Vp13.3 JB2-7 
VB3 Jp2-5 
VB13-3 JB2-5 
VB13-2 JB2-7 
VB2 Jp2-4 
VB13-2 JB2-5 
VB4 Jp2-4 
VB13.3 JB2-7 
VB13.2 JB2-2 
VB3 JB2-5 
VB19 Jp1-3 
Vp3 Jp2-4 
VB12-1 JB2-3 
VB5 Jp2-5 
Vp13.1 JB2-1 
VB15 Jp2-1 
VB1 Jp1-5 


Extended Data Figure 3 | TCR-f rearrangements in T-ALL. a, PCR analyses 
of VB to JP1 (top), and Vf to Jf2 (bottom) rearrangements on genomic DNA 
from T-ALL™ spleen samples. TCR-f rearrangements were identified by 
PCR using a pool of V/-gene-specific forward primers and reverse primers 
to either JB 1.7 (top) or J62.7 (bottom). Dominant, T-ALL-derived TCR-B 
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rearrangements were visualized by gel electrophoresis, and PCR products were 
cloned for DNA sequencing. b, T-ALL samples analysed (sample ID), identified 
VB and Jf elements (Vf — Jf), and classification of in-frame, or out-of-frame 
rearrangements based on the reading frames of the obtained sequences 
(productive/non-productive). 
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Extended Data Figure 4 | Experiments addressing prevention of T-ALL by 
reconstitution with functional wild-type bone marrow. a, Experimental 
design: Rag2/~y, ‘Kit’ mice were transplanted with wild-type thymus 
(as described in Fig. 1a), and at 1, 6 or 10 weeks after thymus grafting were 
injected intravenously with 3 X 10° lineage-negative wild-type bone marrow 
cells (BM). T-ALL was prevented in mice receiving bone marrow 1 week after 
thymus transplantation (n shown in Fig. 2b). b, Flow cytometric analysis of 
bone marrow stem/progenitors demonstrating engraftment of donor ‘rescue’ 
bone marrow (left panels), and the cellular phenotypes in the thymus grafts 
(middle and right panels) (m shown in Fig. 2b). Bone marrow cells are pre-gated 
as lineage marker-negative (lin) and donor cells (CD45.1*). Thymocytes are 
pre-gated for cells of graft origin (CD45.2* CD45.1*, middle panels), and cells 


of bone marrow ‘rescue’ origin (CD45.1*, right). Numbers in the plots refer to 
the percentage of gated cells. For each condition (bone marrow reconstitution 
at 6 or 10 weeks post thymus transplantation), examples are shown for ongoing 
T-cell development from the wild-type bone marrow, but not the thymus- 
resident cells, with no evidence for T-ALL (first and third rows), and for cases in 
which T-ALL developed while T-cell development of wild-type bone marrow 
origin did not prevail (second and fourth rows). The bone marrow data indicate 
that prevention of T-ALL was associated with stronger HSC engraftment by 
wild-type donor cells, however, it cannot be excluded that the overt T-ALL 
already suppressed haematopoiesis in the bone marrow, and hence the 
difference in the bone marrow may be secondary to the emergence of T-ALL. 
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Extended Data Figure 5 | T-ALL emergence. a, Peripheral blood cells GFP-positive cells are shown in green. b, The recipient shown in a was killed on 
(gated for CD45.1") from a normal control mouse (B6/Ly5.1), and a week 18, and the indicated organs were analysed. RAG2p-GFP* -gated cells 
Rag2‘~y./ Kit” recipient carrying a RAG2p-GFP (CD45.1~*) donor are green. Data are representative for RAG2p-GFP* thymus-derived 

thymus were analysed by flow cytometry at the indicated time points. T-ALL (n = 12). 
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a T-ALL 


Coding DNA sequence Mutation Prediction on protein level 
sample description type Exon Domain Effect Description 
162 c.4793A>C sub 26 HD-N Missense _p.H1598P 
162 c.5003T>C sub 27 HD-C Missense — p.L1668P 
31 c.5003T>C. sub 27 HD-C Missense _ p.L1668P 
86 c.5003T>C sub 27 HD-C Missense _ p.L1668P 
81 c.6864_6865insGGTGCACCG ins 34 TAD ins, eSTOP p.G2296_Q2297 
GCAAGCTTGAATGGCT insWCTGKLEWPV* 
70 c.7038dupC dup 34 TAD fr,eSTOP —_p.N2347Qfs*144 
170 c.7043_7044delinsGGGGGT indel 34 TAD fr, eSTOP p.T2348Rfs*143 
34 c.7052dupC. dup 34 TAD fr,eSTOP —_p.l2352Dfs*139 
65 c.7082_7083dupGG dup 34 TAD fr,eSTOP _p.L2362Gfs*35 
55 c.7082delinsAACCACCC indel 34 TAD fr, eSTOP —p.R2361Qfs*132 
67 c.7082delinsC¢CCCC indel 34 TAD fr, eSTOP = p.R2361Pfs*130 
69 c.7082delinsCC indel 34 TAD fr, eSTOP p.R2361Pfs*129 
71 c.7083delinsCCTC. indel 34 TAD Lins p.R2361_L2362insL 
88 c.7094dupA dup 34 PEST fr,eSTOP —_p.P2366Afs*125 
86 c.7111_7114dupACCC dup 34 PEST fr, eSTOP = p.Q2372Hfs*119 
164 c.7171_7181dupCAGCCACACCT dup 34 PEST fr, eSTOP p.S2396Hfs*4 
31 c.7183_7184insGG ins 34 PEST fr, eSTOP p.S2395RV* 
78 c.7193C>A sub 34 PEST Nonsense p.S2398* 
35 c.7193_7194insAA ins 34 PEST fr, eSTOP p.A2398Rfs*11 
53 c.7193_7194insTG ins 34 PEST fr,eSTOP p.A2398Gfs*11 
51 c.7193_7194insACAA ins 34 PEST fr, eSTOP  p.A2398Qfs*94 
47 c.7258insT ins 34 PEST fr, eSTOP _p.P2420Sfs*72 
82 c.7260_7261insACTC ins 34 PEST fr,eSTOP —_p.L2421Tfs*71 
162 c.7277_7278insGAGTGTGGGG ins 34 PEST fr, eSTOP —_p.P2427Sfs67* 
74 c.7384C>T sub 34 PEST Nonsense p.Q2462* 
15 c.7470delinsGG indel 34 PEST fr, eSTOP  p.E2491GVP* 
b Notch? locus 
> = 1557 
i 
A Notcht ee 
a ys 
Rag?’* T-ALL at Ne 5 g 
13/25 0/5 
c GL GCAGTGCGGGCTGCTAGGAGGGAGGAGGATGCCCCACCTCA(12bp) ACATGAAGG CTTTATAGC(23bp) CACTGTG TAGGCTTGGGTGATTGAGGG 


GCAGTGCGGGCTGCTAGGAGGGAGGAGGATG 
GCAGTGCGGGC 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGAT 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGATGCC. 
GCAGTGCGGGCTGCTAGGAGGGAGGAGG 
GCAGTGCGGGCTGCTAGGAGGGAGGAG 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGATGC 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGA 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGATGCCCCA 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGATGCCC 
GCAGTGCGGGCTGCTAGGAGGGAGGAGGAT 


d Normal 
t 15 31 


T-ALL 


Extended Data Figure 6 | Mutations in and aberrant expression of Notch1 
in T-ALL. a, Exons 26, 27 and 34 of Notch1 were sequenced and the mutation 
position, mutation type and predicted consequence for the protein are listed 
for all analysed T-ALL. All individual mutations were heterozygous and 
correspond to the triangles depicted in Fig. 3e. Substitutions (sub), insertions 
(ins), duplications (dup), complex deletions and insertions (indel), predicted 
frameshift (fr) and early STOP codons (eSTOP) are indicated, as well 

as the protein domains predicted to be affected: amino or carboxyl 
heterodimerization domains (HD-N or HD-C, respectively), transactivation 
domain (TAD), and PEST domain. b, Schematic view of the Notch1 locus, 
depicting 5’ deletions (ANotch1), and positions of the annealing sites for the 
primers used to detect deletions (red triangles). Exons are indicated by black 
bars. Genomic DNA from Rag*’ * (n= 25) and Rag ’ ~ (1 = 5) T-ALL was 
analysed by PCR for the presence of the deletion”. Amplification of the band 
with ~ 500 bp (ANotch1; upper gel) identifies the deletion. The non-deleted 


GA 
AACA 
CTCA 

GGGGGA 
TCA 
CGGAG 
TCTA 


TAGGCTTGGGTGATTGAGGG 
GGTGATTGAGGG 
TAGGCTTGGGTGATTGAGGG 
GCTTGGGTGATTGAGGG 
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locus is too large to yield a band; hence, absence of the band indicates no 
deletions. Glycoprotein 130 (gp130) was amplified as positive control for the 
presence of the DNA template. Deletions were seen in 13/25 RagI‘’*, but in 
0/5 Ragl ‘~ T-ALL. c, PCR products were sequenced to characterize the 
junctions. The germline Notch1 sequence is shown on the top (GL), revealing 
the sequence homology to the recombination signal sequence (RSS) highlighted 
in bold. The conserved residues in the putative cryptic RSS in the 5’ region of 
the Notch1 locus are shown in red. For 10 deletions, the sequences flanking 
the junctions are shown, with evidence of N nucleotide addition. The positive 
control DNA was derived from a T-ALL cell line that contained the deletion 
(provided by Ashworth and Aster). d, cDNAs from spleen (s) and thymus (t) 
from normal mice, and from T-ALL” spleens were analysed for expression of 
canonical Notch1 promoter transcripts (E1E6), alternative Notch1 promoter 
transcripts (E1aE6), Hes1, and f-actin. 
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Extended Data Figure 7 | Venn diagrams. Diagrams underlay the 
transcriptome analysis in Fig. 4. The transcriptomes from normal thymi, 
thymus grafts (10 weeks after transplantation) and T-ALL samples were 
analysed by pairwise comparisons: normal thymus versus thymus grafts (top 
diagram); thymus grafts versus T-ALL (lower right diagram); normal thymus 
versus T-ALL (lower left diagram). The Venn diagram shows numbers of genes 
with expression changes of = fourfold for each comparison. The overlapping 
regions show the genes that were common to the different comparisons. The 
numbers 60, 22 and 64 (in bold) correspond to the genes displayed in the heat 
maps d, e, and f, respectively, in Fig. 4. 
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Extended Data Figure 8 | Molecular correlation between progenitor 
deprivation-driven murine T-ALL and human T-ALL. a, b, Genes 
upregulated in murine T-ALL (list from Fig. 4f) were compared to the 
transcriptome of human T-ALL (a), and human B-ALL (b). Upregulated 
changes are significantly and positively correlated with the expression of the 
orthologous genes all human T-ALL subtypes (GSE26713, 117 patient 


samples) (a). Upregulated changes do not correlate with the expression of 
the orthologous genes in human B-ALL subtypes (GSE13351) (b). The 73 
patient samples in GSE13351 also included 15 T-ALL samples that correlated 
with the mouse T-ALL samples (bottom row). ¢, Intensity values for the 
indicated genes were obtained from the normalized transcriptome data. 


a 
Late changes 
Subtype upregulated genes 
rho P P[perm] 
HOXA 0.34 0.02 0.02 
Immature 0.32 0.03 0.02 
MYB 0.37 0.01 0.01 
Proliferative 0.36 0.01 0.02 
TAL/LMO 0.34 0.02 0.01 
TLX 0.34 0.02 0.02 
b 
Late changes 
Subtype upregulated genes 
rho P P[perm] 
Hyperdiploid 0.05 0.72 0.72 
BCR-ABL -0.02 0.90 0.89 
E2A-rearranged 0.07 0.63 0.63 
MLL 0.08 0.59 0.60 
TEL-AML1 0.11 0.47 0.45 
other 0.02 0.87 0.89 
T-ALL 0.32 0.03 0.03 
© 2000- Talt 3000 5 Lmo2 Z 2000 5 Ly 2000 5 Tix 
1500-4 7 1500 1500 4 
2000 5 a 
> 2 a 2 
2 1000, 7 fat # 10004 § 1000 - 
= aaa 2 = 
7 A 1000 + ss 
5007 ; a 500 + —— 500 + 
_E 
A eo mam” ‘ oe "= ; eo af ie é oo -aF smh 
: : y T T T a a en al =f fh 
FS Ow Fr ye FS SF wy 2 SF yy 
St « aj et oa aq a Ss a ax is < ea “® 


Mean and standard deviation are displayed. 


100:0 


50:50 


chimerism (%) 
3 


25:75 


ARTICLE 


5:95 1:99 0:100 


LSK 


25 
0! LU : is aot 
mousenr: 1 2 3 456 7 8 101112 131415 161718 
100 15 1 
3875 
5 3.6 | 
250 
o 
£ 
s 25 
on 6.5 
mouse n®: 1 2 3 456 101112 131415 Abe 
100 
e765 TNS 
£ 
250 
oO 
£ 
$25 
0) 
mousen®: 1 2 3 456 10 11,12 13.14 15 TEED 
a Ye BB eery5.1  [_Jhost 
D 20000 wt 10000 yt 
2 2 ce 
r= S$ 8000 
a fey 
38 eee BS 6000 
3 g 
S 10000 sy, 4009 
- iy F 2000 
0 spe 0 T yen 
100:0 50:50 25:75 5:95 1:99 0:00 100:0 50:50 25:75 5:95 1:99 0:100 
400000 wt 40000 yh 
2 2 c 
3 300000 5 8000 
= 2 6000 
G 200000 3 
9 4000 
£ 100000 E sine 
oO 0 a 
100:0 50:50 25:75 5:95 1:99 0:100 eo Ses eee kes 4 Bae 


Extended Data Figure 9 | Sensitivity and stage of competition between wild- 
type and IL-7-unresponsive (y,-’~) T-cell progenitors. a, A total of 3 X 10° 
lin” donor bone marrow cells were either pure (100%) wild-type (yet) 
(CD45.1°*; black bars) cells, or pure (100%) Ye? (CD45.2*; red bars) cells, or 
mixtures of both genotypes (ratios indicated at the top), and used to 
reconstitute 1,100 rad-irradiated recipients (CD45.1*CD45.2*). Each bar 
corresponds to a single mouse (no. 1-18). Percentages of chimaerism within 
bone marrow lin’ Sca-1* Kit* (LSK) (that contain haematopoietic stem and 
early progenitor cells) (top), within thymic TN2 (middle), and TN3 (bottom) 


are displayed. Mice were analysed 4 weeks after bone marrow reconstitution. In 
the recipient bone marrow, the ratio of y,‘’* versus y, ’ HSC donor 
engraftment closely mirrored the proportions of the input chimaerism. At the 
TN2 stage, few y, ’ T-cell progenitors were still detectable, but by the TN3 
stage they were completely outcompeted by wild-type donor, or residual 

host progenitors. b, Cell numbers of TN2 (top) and TN3 (bottom) of wild type 
(left panels, black bars) or ve! ~ (right panels, red bars) are displayed as 
averaged cell counts for each group of mice shown in a. Note that the scales 
of the y axis differ between the left and right panels. 
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Extended Data Figure 10 | Comparison of young versus old progenitors in 
the presence, or absence of competition: transcriptome, Bcl2 mRNA and 
protein expression, and apoptosis. a, Venn diagram representing the overlap 
between pairwise comparisons of gene expression (threshold = twofold) 
between young and old with competition (green circle), and between young 
and old without competition (orange circle). To correct for potential 
developmental differences in gene expression at the TN2 to TN3 transition, we 
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sorted TN2 and TN3 cells from a normal thymus. The numbers 9, 26 (in Fig. 5 
only 17/26 genes are displayed), and 19 (all in bold) correspond to the genes 
displayed in the heat maps in Fig. 5j, k and 1, respectively. b, Bcl2 mRNA levels 
as determined by fluorescence intensity in the expression array from sorted 
young and old TN2/TN3 cells. c, Bcl2 protein levels, measured by intracellular 
FACS staining after gating on young and old TN3 cells, are depicted as 

mean fluorescence intensity (MFI). d, Percentage of annexin™ cells in TN3. 
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A Wolf-Rayet-like progenitor of SN 2013cu from 
spectral observations of a stellar wind 


Avishay Gal-Yam!', I. Arcavi!, E. O. Ofek', S. Ben-Ami', S. B. Cenko”, M. M. Kasliwal®, Y. Cao*, O. Yaron!, D. Tal!, J. M. Silverman’, 
A. Horesh’*, A. De Cia’, F. Taddia®, J. Sollerman®, D. Perley*, P. M. Vreeswijk', S. R. Kulkarni*, P. E. Nugent’, A. V. Filippenko® 


& J. C. Wheeler? 


The explosive fate of massive Wolf-Rayet stars' (WRSs) is a key 
open question in stellar physics. An appealing option is that hydro- 
gen-deficient WRSs are the progenitors of some hydrogen-poor 
supernova explosions of types IIb, Ib and Ic (ref. 2). A blue object, 
having luminosity and colours consistent with those of some WRSs, 
has recently been identified in pre-explosion images at the location 
of a supernova of type Ib (ref. 3), but has not yet been conclusively 
determined to have been the progenitor. Similar work has so far 
only resulted in non-detections*. Comparison of early photometric 
observations of type Ic supernovae with theoretical models suggests 
that the progenitor stars had radii of less than 10’? centimetres, as 
expected for some WRSs’. The signature of WRSs, their emission 
line spectra, cannot be probed by such studies. Here we report the 
detection of strong emission lines in a spectrum of type IIb super- 
nova 2013cu (iPTF13ast) obtained approximately 15.5 hours after 
explosion (by ‘flash spectroscopy’, which captures the effects of the 
supernova explosion shock breakout flash on material surrounding 
the progenitor star). We identify Wolf-Rayet-like wind signatures, 
suggesting a progenitor of the WN(h) subclass (those WRSs with 
winds dominated by helium and nitrogen, with traces of hydrogen). 
The extent of this dense wind may indicate increased mass loss from 
the progenitor shortly before its explosion, consistent with recent 
theoretical predictions’. 

Wolf-Rayet stars are massive stars stripped of their outer, hydrogen- 
rich envelope. These stars blow strong, hydrogen-poor winds. The inner 
part of the wind engulfing the star is dense and optically thick, and 
efficiently absorbs the ionizing continuum from the hot stellar surface. 
Farther from the star, the density drops and the wind becomes optically 
thin in the continuum, leading to a rich emission spectrum of recom- 
bination lines. Detailed models of such spectra can be constructed’” and 
depend essentially on only three parameters: the effective temperature, 
Tess in the line-forming region, a normalized radius, R, (a combination 
of the stellar radius, luminosity, mass loss rate and wind terminal 
velocity’), and the chemical composition, Z, of the wind (assumed to 
be uniform, spherical and ofa constant mass loss rate). The composition 
of the wind determines Wolf-Rayet spectral classes: stars with dominant 
He and N lines belong to the WN class (with those also showing traces 
of H usually denoted as WN(h)), stars with strong carbon lines belong 
to the WC class and rare (and possibly hotter) stars with oxygen-rich 
spectra belong to the WO class’. 

Shortly after a WRS explodes as a supernova, the outer parts of the 
wind (which in some cases! extend to radii of >10'* cm) that have not 
yet been swept up by the expanding supernova ejecta will emit strong 
recombination lines in response to ionizing flux released by the explosion 
shock breakout from the stellar surface. We estimate an increase in ion- 
izing luminosity by a factor of order 107-10*, assuming an initial absolute 
magnitude range of —2.5 mag < M < —10 mag for the exploding WRS' 


anda typical early-time luminosity of M = — 12.5 mag for the resulting 
supernova’**. The effective temperature, T.., will also change, being 
very high (>10°K) shortly after explosion? and decreasing as the 
shocked supernova ejecta cool. The radiation illuminating the surviv- 
ing Wolf-—Rayet wind will thus effectively vary monotonically through 
the range of temperatures in WRS line-forming regions. Because the 
wind parameters (composition, mass loss rate and terminal velocity) 
do not change, the measured line spectrum observed shortly after explo- 
sion should be similar to that of a WRS with the spectral class of the 
exploding progenitor (because the spectral classes mainly reflect the 
wind composition). The high wind densities around WRSs (with electron 
densities of n. = 10''-10'*cm *) imply short recombination times", 
tree ~ 3.9 X 10'?(n,/em~*)~'(T/10* K)°® s, typically of a few minutes 
for Wolf-Rayet densities and temperatures, and so the emitted spectrum 
will promptly react to the rapidly evolving supernova radiation field. 
We obtained rapid spectroscopic observations of the recent type IIb 
supernova SN 2013cu (iPTF13ast) shortly after shock breakout (by 
flash spectroscopy; see Methods). This event was first detected by 
the Intermediate Palomar Transient Factory (iPTF) survey'' on 2013 
May 3.18 (we express dates in coordinated universal time (UTC)), photo- 
metrically confirmed 5.8 h later and promptly identified by an on-duty 
astronomer who triggered rapid follow-up observations”, including an 
optical spectrum obtained just 4h later. Analysis of the early-time light 
curve of this supernova (Extended Data Fig. 1) suggests that it exploded 
on May 2.93, implying that the first iPTF detection and the first spec- 
trum correspond to only 5.7h and 15.5h after the explosion, respect- 
ively. A full description of the supernova and its evolution will be 
reported in a forthcoming publication (A.G.-Y. et al., manuscript in 
preparation). We note that this event was independently observed by 
the MASTER survey on May 5.3 (~2.3d after explosion), and it was 
assigned the name SN 2013cu following spectroscopic confirmation”. 
Our first spectrum of SN 2013cu reveals a continuum and emission 
lines that bear a striking resemblance to spectra of WRSs (Fig. 1a). 
According to accepted Wolf-Rayet terminology’, the spectrum is clas- 
sified as subclass WN6(h) (Fig. 1a, blue trace); the relative strength of 
nitrogen to carbon lines precludes a WC classification, and the absence 
of any high-excitation oxygen lines is inconsistent with a WO star. The 
stronger lines (Ha, HB, Niv (7,115 (7,115A wavelength) and He 
5,411) exhibit a complex profile (Fig. 2) consisting ofa relatively broad 
base (~ 2,500kms ° full width at zero intensity (FWZI)) on which 
prominent narrow, unresolved lines (FWZI ~ 3A; velocity dispersion, 
<150kms~') are superimposed. This is consistent with predictions 
for Wolf-Rayet pre-supernova wind velocities'*”*, although we cannot 
exclude the possibility that at least some of the observed line broadening 
is produced by electron scattering rather than genuine velocity disper- 
sion. To the best of our knowledge, no similar spectra of a stripped 
(H-poor) supernova have been acquired previously. Wolf-Rayet-like 
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spectroscopic features have been observed in spectra of some H-rich 
(non-stripped) supernovae obtained at substantially later epochs, and 
their typically much broader lines were interpreted as emerging from 
interaction with circumstellar material’® (CSM). We further discuss these 
previous observations in Extended Data Fig. 3. 

We analyse our very early spectrum using the PoWR grid of Wolf- 
Rayet spectral models’ (http://www.astro.physik.uni-potsdam.de/~wrh/ 
PoWR/powrgrid1.html). We find an excellent fit with WN(h) models 
calculated assuming an H fraction of 20% (by mass) and a temperature of 
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Figure 1 | Spectroscopy of the type IIb supernova SN 2013cu reveals 
transient Wolf-Rayet-like features. a, The early spectrum of SN 2013cu 
(black; specific flux, F;, versus wavelength) is compared with models of 
WNL-class WRSs’ (red and grey), showing remarkable similarity in continuum 
shape and in line features (strong He, N and Balmer lines indicate a WN6(h) 
classification; compare with HD 192163 (blue)). b, Emission line evolution 
during the first week. The first spectrum (red) is compared with later spectra. 
By day 3 (blue and cyan), the initially strong Wolf-Rayet features disappear, but 
the Ho line remains constant at 3.4 10 ergs ‘cm *. The spectrum on 
day 6 (magenta) is almost featureless, except for weak Ha emission (inset) with 
an intensity less than a tenth that of day 3, probably because the line-forming 
region has been cleared by the expanding ejecta. c, SN 2013cu is of type IIb. 
A spectrum of SN 2013cu 69 days after explosion (black) is compared with 
the prototypical type IIb supernovae SN 1993] (60 days; red) and SN 2011dh 
(43 days; green), and with the typical non-stripped type II-P supernova 

SN 2004et (45 days; blue). SN 2013cu, SN 2011dh and SN 1993] exhibit strong 
blue-shifted He1 absorption at 5,876, 6,678, 7,065 and 7,281 A (marked with 
black vertical lines), which are not detected in type II-P supernovae. See 
Methods for more details. 


around 50,000 K. This temperature is consistent with the lower limit 
obtained from early ultraviolet photometry from NASA’s Swift space- 
craft (Extended Data Fig. 1). The essentially perfect match of the ob- 
served and modelled continuum shapes indicates that dust reddening 
must be negligible; any pre-existing dust must have been destroyed by 
the supernova explosion flash (Methods). We note that among the large 
catalogue of Galactic WN stars specifically modelled in this manner’, 
no stars drive winds that require more than 56% H by mass. Presum- 
ably, custom spectral fits’ could be calculated and used to determine 
more accurately the physical parameters of the detected Wolf-Rayet 
wind. Assuming that the spectrum was obtained 15.5h after explosion 
and that a standard ejecta velocity of 10*km s_' is applicable, the narrow- 
line-emitting material must be located at radii greater than ~5 X 10'° cm 
for it not to have been swept up and accelerated by the expanding ejecta. 
This lower limit is consistent with the extent of some Wolf-Rayet 
winds, where the line-formation region extends out to several (five to 
ten) times’” the hydrostatic radius. Recent pre-supernova Wolf-Rayet 
models’ suggest hydrostatic radii of 10-20 solar radii for WN super- 
nova progenitors, consistent with line-formation regions extending to 
several hundred solar radii!, or >10'3 cm. 

We further constrain the physical location of the wind using the fol- 
lowing calculation. We measure the Ho line flux from our spectrum 
(calibrated to our host-subtracted photometry) and obtain Fy;,, = 3.4 X 
10 ergs ‘cm ~. Wetranslate this to line luminosity using a lumin- 
osity distance to the host galaxy, UGC 9379, of d = 108 Mpc (calculated 
for a flat A cold dark matter cosmology with Hubble constant Ho = 
73kms ! Mpc’ '" a matter density of 2,, = 0.27 and a redshift z= 
0.025734, obtained from the NASA Extragalactic Database (NED), as 
well as negligible extinction), obtaining Ly. = 4.8 X 10°’ ergs '. We 
can then estimate’® the pre-explosion H mass loss rate 


M=0.01(Lu,/(2 x 1029 erg rap ks 


x (vw /(500 km s_')) (r/(10° cm))/? ; 
assuming that the lines are formed at a radius r by recombination, that 
the width of the emitting shell is similar to its radius, that the explosion 
is spherical symmetric and that the density of the wind decreases as r 7. 
We assume a wind velocity of v,, = 2,500kms~', consistent with our 
spectra and as expected for WRSs, but our results are not sensitive to 
this value and remain essentially unchanged for velocities in the range 
100 < vw <2,500kms?. 

We check for self-consistency by calculating the electron density 
and, hence, the Thomson optical depth 


t=0.3(M/(0.01Mo yr_')) 


Mo yr~ 


x (vw / (500 km s~1))7"(r/o' cm))" 
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Figure 2 | Emission line velocity structure at 15.5h. The strongest lines 
(Ho, HB, He tt 45,411 and the Nrv 47,115 complex) show broad wings 


expected at this radius, and require it to be lower than t = 1 for the line 
emission to escape. We find that this self-consistency requirement places 
a lower limit of r = 2 X 10'* cm on the radius of the line-formation region, 
and implies substantial mass loss rates, M>0.03Mq yr~!. If we inter- 
pret the disappearance of essentially all emission lines from our day-6 
spectrum (Fig. 1b) as evidence that the wind was swept up by the expand- 
ing ejecta (moving at 10*kms '), the radius of the line-emitting region 
must be r< 5.2 X 10! cm, which is fully consistent with our estimates. 
We can then calculate the total H mass by integrating over r: 


Myo, =0.006(14/(0.01Mo yr ')) 


x (Yw /(500 km s~')) “ (r/(10"° cm)) ~'Mo 


This indicates a range of 0.008Mo < Miot <0.0035Mo@ for the range of 
permitted H masses. Assuming that the typical H abundance for WN(h) 
stars (~20%) applies, the total wind mass (dominated by He) can be 
estimated to be several times larger than these values. Detailed simula- 
tions’’ show that as little as 0.1 Mo of He-dominated CSM would result 
in strong spectroscopic interaction signatures (that we do not observe), 
consistent with our derived total masses. 

We conclude that we have directly detected a Wolf—Rayet-like wind 
from the supernova progenitor with a WN(h) spectral class, indicating 
alow H mass fraction. Assuming that the wind composition we measure 
represents the surface composition of the progenitor star, our observa- 
tions indicate that some members of the spectroscopic WN(h) Wolf- 
Rayet class explode after having lost most of their hydrogen envelope, 
exposing the CNO-processed, N-rich He layer below. Analysis of 
photometric and spectroscopic follow-up observations (A.G.-Y. ef al., 
manuscript in preparation) confirms that the explosion was indeed a 


extending out to ~2,500km s |. Other weaker lines are narrow 
and unresolved. 


supernova of type IIb (Fig. 1c), as expected if the progenitor was a 
massive star that lost all but ~ 0.1Mo of its H envelope”. 

Our observations have interesting implications. First, we note that 
the derived values of the mass loss rate and emission-line-region size 
are quite extreme compared with known Wolf-Rayet observations and 
radiatively driven models”, including models with clumpy, inflated 
atmospheres”. This suggests that the mass loss rate from the progen- 
itor star may have increased shortly (of order 1 yr for the assumed velo- 
cities) before its explosion. Interestingly, such pre-supernova activity may 
be explained by recent wave-driven models’, or a more extreme envelope 
inflation” may be indicated. These data can thus provide a key diagnostic 
of the final stages of nuclear core burning in massive stars, which are 
currently poorly understood, with possible implications for the explo- 
sion mechanism itself. In any case, the star probably exploded inside a 
thick wind, and the explosion shock may have broken out from the 
opaque inner wind rather than from the hydrostatic surface of the star’. 

Our finding is in general accord with some previous work on type IIb 
supernova progenitors. Direct imaging of the progenitor of SN 2008ax 
(ref. 23) is consistent with a WN(h) progenitor. Furthermore, increased 
mass loss during the final year before explosion may inflate the appar- 
ent photospheric radius of the pre-supernova star, making stars with 
compact cores appear to have extended (low-mass) envelopes” and pos- 
sibly reconciling the conflicting findings about the progenitor of the 
type IIb supernova SN 201 1dh (refs 24-27). Regardless of the exact mech- 
anism, our observations suggest that substantial Wolf—Rayet-like winds 
pre-date at least some type IIb supernovae. A strong metallicity depen- 
dence of this process may explain the trend in the type IIb/type Ib 
supernova number ratio with host-galaxy metallicity**. Future studies 
of numerous additional supernova progenitors via their spectroscopic 
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wind signatures (Methods) would provide powerful constraints on the 
final stages of massive-star evolution. 


METHODS SUMMARY 

Photometry. The iPTF survey telescope was used to obtain r-band observations. 
Photometry was measured using our custom pipeline performing point spread func- 
tion (PSF) photometry on iPTF images after removing a reference image con- 
structed from pre-explosion data using image subtraction. Swift ultraviolet absolute 
AB magnitudes (Extended Data Fig. 1) were measured using standard pipeline reduc- 
tion and were corrected for host-galaxy contamination using late-time Swift images. 
Spectroscopy. Our earliest (15.5 h) and latest (69 d) spectra were obtained using 
the DEIMOS spectrograph mounted on the Keck II 10-m telescope using the grat- 
ing with 600 lines per millimetre and an exposure time of 600 s. Additional spectra 
were obtained using ALFOSC mounted on the 2.56-m NOT telescope, LRS mounted 
on the 10.4-m HET telescope (day 3), and LRIS mounted on the KeckI 10-m tele- 
scope (day 6). All spectra were reduced using standard pipelines and are digitally 
available on WISeREP”. The method of flash spectroscopy is described in detail 
in Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Photometry. r-band images were obtained by the iPTF survey camera mounted 
on the Palomar 48-inch Schmidt telescope’. Photometry was measured using 
our custom pipeline performing PSF photometry on iPTF images after removing a 
reference image constructed from pre-explosion data using image subtraction. 
Swift ultraviolet absolute AB magnitudes (Extended Data Fig. 1) were measured 
using standard pipeline reduction®’ and were corrected for host-galaxy contam- 
ination using late-time Swift images. 

Spectroscopy. Our earliest (15.5-h) and latest (69-d) spectra were obtained using 
the Deep Imaging Multi-Object Spectrograph (DEIMOS*) spectrograph mounted 
on the Keck II 10-m telescope using the 600 line mm! grating and an exposure 
time of 600s. Additional spectra were obtained using ALFOSC mounted on the 
2.56-m NOT telescope, LRS mounted on the 10.4-m HET telescope (day 3) and the 
Low Resolution Imaging Spectrometer*’ (LRIS) mounted on the KeckI 10-m 
telescope (day 6). All spectra were reduced using standard pipelines and are digit- 
ally available on WISeREP”’. 

Spectroscopic observations are presented in Fig. 1. In Fig. 1a, the early spectrum 
of SN 2013cu reveals Wolf-Rayet wind features. The spectrum (black) is com- 
pared with WNL models’ (red and grey curves, offset vertically for clarity) showing 
remarkable similarity, both in line features (major species marked; strong He and 
N lines accompanied by Balmer lines indicate a WN6(h) classification) and in the 
continuum shape (demonstrated by overplotting the 56-kK model on the spectrum). 
The similarity in continuum shape to hot model spectra constrains any dust red- 
dening to be minimal, indicating that any pre-existing circumstellar dust must 
have been destroyed; compare with the observed spectrum of the WN6(h) star 
HD 192163 (blue). Consistent with this conclusion, we detect no trace of Na D 
absorption lines. Figure 1b shows the emission line evolution during the first week. 
We replot the first spectrum (red), compared with spectra obtained ~3 d after explo- 
sion (using ALFOSC (blue) and LRS (cyan)) and ~6d after explosion (magenta; 
additional details in A.G.-Y. et al., manuscript in preparation). Spectra were scaled 
and offset for clarity with respect to the high-quality Keck/LRIS spectrum. Con- 
tinuum shapes are identical (within calibration uncertainties) and consistent with 
the Rayleigh-Jeans tail of a hot Planck curve, indicating that the black-body peak 
remains in the ultraviolet at 6 d past explosion. By day 3, the initially strong Wolf- 
Rayet features disappear, with the exception of weaker He mt lines and the C m/N 1m 
complex around 4,640 A (marked), whereas the Ho line remains constant at 3.4 X 
10° ergs” cm”? (with decreased equivalent width due to higher continuum level). 
The spectrum with a high signal-to-noise ratio on day 6 is almost featureless, except 
for weak Ho emission (inset) with an intensity less than a tenth that of day 3, pro- 
bably because the line-forming region has been cleared by the expanding ejecta. 
Figure 1c shows evidence that SN 2013cu is of type IIb. A spectrum of SN 2013cu 
69 d after explosion (black) is compared with the prototypical type IIb supernovae 
SN 1993] at age 60d (red) and SN 2011dh at age 43 d (green), and with the typical 
non-stripped type II-P supernova SN 2004et at age 45 d (blue). To allow for slight age 
differences and expansion-velocity variations, we align all spectra in wavelength 
using the weak blue-shifted Fe 11 lines at 4,924, 5,018 and 5,169 A (marked with black 
vertical lines), because they are good tracers of the photosphere. Supernovae SN 
2013cu, SN 2011dh and SN 1993) exhibit strong He 1 absorption at 5,876, 7,065 and 
7,281 A (marked with black vertical lines), whereas the weaker 6,678 A absorption is 
similar in SN 2013cu and SN 2011dh. These He! lines are not detected in the type II- 
P supernova spectrum at all. However, SN 2013cu shows weaker Balmer absorption 
(marked with magenta vertical lines), and the Ho. absorption is not clearly defined. 
Both the spectral similarity to supernovae SN 1993J and SN 201 1dh and the strong He1 
lines relative to H indicate that SN 2013cu is spectroscopically a type IIb supernova. 
Flash spectroscopy. We define as ‘flash spectroscopy’ the technique of obtaining 
a set of spectroscopic data shortly enough after a supernova explosion that the 
observed spectrum is dominated by features related directly to the effects of the 
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shock breakout flash. In particular, flash-ionized CSM recombines and forms 
strong emission lines, revealing, for example, the elemental abundance and, thus, 
the Wolf-Rayet class of a supernova progenitor. In addition, emission line spectra 
provide a handle on the early temperature evolution, which is difficult to measure 
even using Swift ultraviolet photometry because the black-body peak is initially too 
far into the ultraviolet. This study provides strong motivation for future investi- 
gations using dedicated rapid-response spectrographs, such as FLOYDS*™ and 
SEDM”, responding to real-time triggers from high-cadence wide-field surveys’. 

Although Wolf-Rayet supernova progenitor stars are difficult to study using 
pre-explosion imaging (owing both to intrinsic low luminosity in the optical and 
infrared bands'*”* and to the possible confusing effect of a bright O/B companion’), 
we demonstrate that they are amenable to study using the flash spectroscopy 
method. WRSs belonging to the WN(h) class may have the most extensive 
winds'*. Application of this method to WC and WO stars may require flash spec- 
troscopy at even earlier epochs (~1 h after explosion), before the supernova ejecta 
sweep up the high-density wind. Reducing the latency between the explosion 
and the spectroscopy by an order of magnitude relative to our observations of 
SN 2013cu is possible using recently commissioned instrumentation (Extended 
Data Fig. 2). 

Unlike studies of supernova progenitors through pre-explosion imaging, the 
flash spectroscopy method can be applied to relatively distant objects (SN 2013cu 
is located 108 Mpc away, well beyond the 20-Mpc distance typical for pre-explosion 
studies) and to events in galaxies having no high-quality pre-explosion imaging, 
such asa large population of little-studied dwarf galaxies. Judging from local super- 
nova rate measurements*’, ~300 explosions occur within 100 Mpc of Earth every 
year and can be potentially studied in this manner. The method thus allows routine 
spectroscopic studies of supernova progenitors that were previously only possible 
by extreme serendipity (for example for the progenitor of SN 1987A; ref. 38). 
Within a few years, there will be enough data for the flash spectroscopy method 
to be used to chart wind signatures from numerous supernova progenitors; in 
particular, it will be possible to study systematically the Wolf-Rayet progenitor 
population of stripped supernovae. We thus expect that this method will be 
broadly applied in the coming years. 
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Extended Data Figure 1 | The r-band light curve of SN 2013cu. A parabolic modifying the subset of points used in the fit. Our first Keck spectrum was 


model of the flux—time (red solid curve) describes the pre-peak data (lo obtained about 15.5 h after explosion (vertical dotted line). Early Swift 
error bars) very well. Backward extrapolation indicates an explosion date of ultraviolet photometry (diamonds) places a lower limit of T = 25,000 K on the 
utc 2013 May 2.93 + 0.11 (MJD = 56414.93; 5.7 h before the first iPTF black-body temperature measured 40h after explosion. 


detection; see inset); we estimate the uncertainty from the scatter generated by 
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Extended Data Figure 2 | Flash spectroscopy: rapid spectroscopic 
observations of supernovae during or shortly after shock breakout. This 
simulated SEDM spectrum (red) created by downgrading the observed Keck 
spectrum (black; resolution, R = 2,000) to the coarse SEDM resolution 

(R = 100) shows that the strong Wolf-Rayet lines (in this case the marked H, 
He and N lines) are still easily detectable and allow us to determine the Wolf- 
Rayet spectroscopic class. The SEDM is an IFU low-resolution spectrograph 
designed for robotic response to transient events, to be mounted on the 
Palomar 60-inch telescope almost continually. Responding to real-time triggers 


from the iPTF survey operating on the same mountain, this instrument should 
be able to obtain low-resolution spectra within ~1h of object detection. 
Because SEDM operates on a smaller telescope than Keck, SEDM data of 
similar quality to the simulated spectrum will require a relatively long 
integration. However, SEDM will be able to observe objects with much reduced 
latency, thus benefiting from stronger line intensities expected from the 
stronger shock breakout flash luminosity processed by a denser wind close to 
the progenitor star, potentially compensating for its reduced absolute 
sensitivity. 
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Extended Data Figure 3 | Comparison with early ‘Wolf-Rayet’ spectra of 
SN 1998S. Wolf-Rayet-like features similar to those we observed were 
previously noted in two cases, SN 1983K (refs 39,40) and SN 19988 (ref. 16), 
and persisted for many days after explosion. The spectra of SN 1983K, classified 
as a type II-P supernova, are unfortunately not available for comparison. 
Spectra of SN 1998S (type IIn) are shown here. The spectra have a similar blue 
continuum slope and a similar Ha profile. The He 1 15,411 and the N 1v 47,115 
complex are weaker in SN 1998S, and the strong lines of N and C (5,806 A) 
are broad, consistent with an origin in shocked CSM" rather than in an 


undisturbed Wolf-Rayet wind. The inset shows a close-up view of the strong 
Nv 47,115 complex. N emission from SN 1998S is weak compared with 

Het 47,065 and shows a smooth, broad profile, whereas SN 2013cu exhibits 
a broad base (full-width at half-maximum, 2,500 kms!) as well as strong and 
narrow (unresolved) N rv lines. These observations are consistent with an 
origin for the N emission of SN 1998S in shocked (and perhaps N-rich) CSM"*, 
whereas the narrow He! lines may come from a more distant, photo-ionized 
wind. In SN 2013cu even the narrow N lines are much stronger than He, 
indicating a wind with a Wolf-Rayet-like composition at all radii. 
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Practical quantum Key distribution protocol without 
monitoring signal disturbance 


Toshihiko Sasaki’, Yoshihisa Yamamoto”? & Masato Koashit 


Quantum cryptography’ exploits the fundamental laws of quan- 
tum mechanics to provide a secure way to exchange private informa- 
tion. Such an exchange requires a common random bit sequence, 
called a key, to be shared secretly between the sender and the receiver. 
The basic idea behind quantum key distribution (QKD) has widely 
been understood as the property that any attempt to distinguish 
encoded quantum states causes a disturbance in the signal. As a result, 
implementation of a QKD protocol involves an estimation of the 
experimental parameters influenced by the eavesdropper’s inter- 
vention, which is achieved by randomly sampling the signal. If the 
estimation of many parameters with high precision is required, the 
portion of the signal that is sacrificed increases, thus decreasing the 
efficiency of the protocol®°. Here we propose a QKD protocol based 
on an entirely different principle. The sender encodes a bit sequence 
onto non-orthogonal quantum states and the receiver randomly dic- 
tates how a single bit should be calculated from the sequence. The 
eavesdropper, who is unable to learn the whole of the sequence, can- 
not guess the bit value correctly. An achievable rate of secure key dis- 
tribution is calculated by considering complementary choices between 
quantum measurements of two conjugate observables". We found 
that a practical implementation using a laser pulse train achieves a 
key rate comparable to a decoy-state QKD protocol’, an often- 
used technique for lasers. It also has a better tolerance of bit errors 
and of finite-sized-key effects. We anticipate that this finding will give 
new insight into how the probabilistic nature of quantum mechanics 
can be related to secure communication, and will facilitate the simple 
and efficient use of conventional lasers for QKD. 

Ina QKD protocol, the sender Alice and the receiver Bob repeat trans- 
mission of quantum signals and accumulate raw bits of data through 
quantum measurements. Using public communication, each of them dis- 
cards the apparently useless portion of the raw data to form a bit sequence 
called a sifted key. The sifted key of length N is then processed into the 
final key of a shorter length through error reconciliation and privacy 
amplification. Denoting the costs of the two procedures as Hpp and 
Hpa, the net production length G of the secure final key is given by 


G=N(1—Her—Hpa) (1) 


When the bit errors between Alice’s and Bob’s sifted key occur at a rate 
€pit, the ideal cost of error reconciliation in the asymptotic limit of large 
Nis given by Hgr = h(epit) as Shannon entropy, with h(x): = —xlog.x 
— (1 — x)log,(1 — x). The cost Hp, depends on how much informa- 
tion on the sifted key has leaked to an eavesdropper Eve. For example, 
it is given by Hp, = h(epit) ina simple proof’ for the Bennett-Brassard 
1984 (BB84) protocol’. In general, the formula varies depending on pro- 
tocols and security proofs, and parameters other than epi, are often mon- 
itored in the protocol and enter into the formula of Hp,. Nevertheless, 
so far Hp, has always been an increasing function of the amount of dis- 
turbance. This implies that the conventional QKD protocols’ *’** inher- 
ently rely on the original version of Heisenberg’s uncertainty principle, 
which dictates that the more information Eve has obtained, the more 
disturbance she should have caused on the signal. 


What we propose here is an entirely new approach to establishing 
private correlations between Alice and Bob under the presence of an eaves- 
dropper Eve, in which the leaked information to Eve is bounded regard- 
less of the disturbance that she causes on the quantum signal. The main 
idea is to encode many raw key bits on quantum systems coherently such 
that only a few bits can be read out at the same time, which enables Bob 
to specify randomly how the sifted key bit is calculated from the raw 
key bits. This randomness makes it hard for Eve to guess the calculated 
bit from what little knowledge on the raw key bits she has acquired. 

Let us explain our QKD protocol in more detail using the schematics 
shown in Fig. 1a. The protocol proceeds as follows. (I) Alice encodes a 
random L-bit sequence ss - - - s, ona weak signal. For the understand- 
ing of the basic idea, here we assume that the encoded signal is a single- 
photon state of L optical pulses 


1< rm 
|'¥1) aa ee |k) (2) 


where the photon is in the kth pulse for state |k). (II) After possible 
intervention by Eve, Bob receives the signal. (III) An independent random 


Figure 1 | Basic idea behind the proposed QKD scheme. a, b, Quantum 
signals flow through thick lines and classical ones through thin lines, in the 
order indicated by the Roman numerals. Eve tries to guess Alice’s bit 

Sa = 5; ® 5; in both figures, where indices {i, j} are announced by Bob. Ina, Bob 
conducts measurement M following random number generator RNG to guess 
sa. In b, Bob conducts measurement M’ prior to RNG, making it hard for 
anyone to guess sq. As the procedures to generate indices {i, j} in both figures are 
identical, every strategy used by Eve in a should work equally well in b. 
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number generator (RNG) announces a random value r €{1, L-1}. 
(IV) Through an optical interference measurement M, Bob tries to deter- 
mine the value of s;@ 5; for a pair of indices {i, j} <{1, ---, L} satisfying 
j —i=+r(mod L). Here the symbol © denotes summation modulo 2. 
In measurement M, Bob splits each pulse by a half beam-splitter and 
then superposes the kth and the k’th half pulses (k’ = k + r(mod L), 
k =1,..., L) to measure the phase difference by detecting a photon. 
Whenever a photon is detected from the superposed ith and jth pulses, 
Bob announces {i, j} and records the measured phase difference as his 
sifted key bit sg. Alice records s, = s;@s;as her sifted key bit. As shown 
in the Methods, if Bob receives the state | WY) intact, he learns sq with- 
out errors. 

We are now interested in how well Eve can guess the value of s,. 
Figure 1a alone is not conclusive in this regard, because she has a con- 
trol over the decision process of the indices {i,j} through feeding a 
modified signal to Bob at step (II). To show that Eve’s control is quite 
limited, consider another measurement procedure by Bob shown in 
Fig. 1b. In measurement M’, Bob simply measures the location of the 
photon in the incoming L pulses to determine one of the indices, i. He 
also generates a random bit b. Subsequently, the RNG announces 1, 
which determines the other index as j = i + (— 1)’r(mod L). Asis proved 
in the Methods, this procedure is equivalent to M as far as the production 
of outcome {i,j} is concerned. Hence, it suffices to show Eve’s igno- 
rance of sa in Fig. 1b. 

An intuitive reasoning for the ignorance is given as follows. Alice has 
emitted just one photon, so most of the L bits should be unknown to 
Eve when L>>1. In Fig. 1b, Eve’s intervention affects only the decision 
or index i, and the other index j is chosen randomly from the rest of the 

— 1 bits through the random number r. We may thus expect that Eve 
as little information on s, and hence on sa = s,;®s;. 

What is remarkable here is that the above argument has no reference 
to how much Eve has disturbed the signal received by Bob. To make a 
rigorous security proof, we have only to show that Alice’s sifted key bit 
sa in Fig. 1b can be accumulated and converted to a secure final key, 
based on the fact that r is random and independent of i. There is no 
need to mention directly the state fed to Bob by Eve in the proof, and it 
is still valid for any attack strategy by Eve. 

The difficulty in guessing the value of randomly chosen bit s; appear- 
ing in Fig. 1b has been discussed ina slightly different context and called 
the information causality'®. Our QKD scheme may be regarded as the 
combination of the information causality, which holds for classical and 
quantum signals alike, and the complementarity, which is unique to quan- 
tum mechanics. Bob’s measurement in Fig. 1a reveals the phase differ- 
ence, a wave-like property, while that in Fig. 1b identifies the location of 
the photon, a particle-like property. In quantum mechanics, such dif- 
ferent measurements may result in incompatible consequences. In fact, 
Bob learns s, in Fig. 1a, whereas in Fig. 1b the information causality 
forbids anyone from learning s4, including Bob. The mere possibility of 
Bob’s choosing the latter prevents Eve from learning s, even if Bob has 
actually chosen the former. 

It is also worth mentioning how our QKD protocol differs from the 
B92 protocol’. They are similar at many points. In both, Alice encodes 
the bit values on non-orthogonal states. Bob dictates which of the bits 
should be used. The dictation may be tampered with by Eve via modifi- 
cation of the signals, because it is based on the outcomes of Bob’s measure- 
ment on them. It is, however, only our protocol that has a complementary 
scenario shown in Fig. 1b, which substantiates the existence of inherent 
randomness in Bob’s dictation, beyond the reach of Eve’s tampering. 

The above basic idea can be implemented simply by a weak coherent 
laser pulse train as a light source and a variable-delay interferometer at 
the receiver (Fig. 2), which we name the round-robin differential phase- 
shift (RRDPS) QKD protocol. The setup is exactly the same as the dif- 
ferential phase-shift QKD protocol”’”"* except that the fixed delay line 
in the original is replaced by a variable delay line. For a security proof, 
we adopt a simple characterization of the source about the total photon 
number v in the L-pulse train stated in the form of 
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Figure 2 | Practical implementation of the proposed QKD scheme. Alice’s 
laser emits a train of L pulses with interval T. She applies phase shift {0, m} 
on each pulse according to a random bit sequence s, ... sz. Bob splits the 
received train into two beams and superposes them after a random delay 

r'T (r’ € {1, ..., L — 1}). Detection of a photon determines Bob’s sifted key bit 
sp, and he announces the indices {i, j} of the corresponding pair of pulses. 
Alice adopts sa = s; ® s; as her sifted key bit. The key rate is improved by 
applying random phase 6 on each train (see the Methods). 


Pr(v> Vin) < Src (3) 


with an integer vin < 


and a constant e,,,.. Let Q be the empirical 


rate of detection Q: = N/Nem when a sifted key of length N is generated 
through N.., rounds of transmitting L-pulse trains. Then we can derive 


an asymptotic formula for the net production length of the secure key 
(see the Methods) 


o=N|} h(evit) a (: =) (7) | (4) 


For clarity, let us consider the case where a nonclassical light source 
with e,,. = 0 is used instead of the laser, for which 


G=n{1 h(evie) n(-*)| (5) 


The case with vy, = 1 corresponds to the state | Y,) used in the explana- 
tion of the basic idea. In equation (5), the third term Hp, = h(v4,/(L — 1)) 
isa constant, which is in stark contrast with the conventional QKD pro- 
tocols for which Hp, depends on the disturbance. The constant value 
of Hp, leads to two advantages of the new QKD protocol: (1) It has 
high tolerance of bit errors. For example, for L = 128 and vy, = 1, Gis 
positive up to epit = 0.35. There is no fundamental limit on the error 
threshold smaller than 50%. (2) The secrecy of the final key is estab- 
lished after shortening the key length via privacy amplification by a 
fixed and predetermined fraction Hp,. There is no need to sacrifice a 
randomly chosen subset of signals to estimate an appropriate value of 
Hpa; which affects the rate of finite-sized key generation?'°”". 

For the use of weak coherent pulses (WCPs) from a conventional laser, 
we show examples of asymptotic key rates per pulse, G/(LNem), as a func- 
tion of channel transmission 7 in Fig. 3. For ground-based transmission, 
an optical fibre of 50 km decreases 77 bya factor of 10, whereas 1 = 10 * 
to 10 ° is expected for satellite-based transmission” *. Figure 3 also 
shows rates of the BB84 protocol for comparison. When WCPs with the 
second-order correlation g”)(0) = 1 ora realistic single-photon source*>”” 
with g(0) = = 0.01 are used for BB84, the multi-photon emission from 
the source is exploited by Eve via photon-number splitting attacks*”?, 
resulting in a poor key-rate scaling of O(’). The present scheme with 
WCPs has a better scaling, close to O(17), and surpasses the WCP-based 
or the single-photon-based BB84 protocol for small 1. 

There is a popular technique called decoy-state QKD’*", in which 
pulses with different amplitudes are randomly mixed in the signal to 
monitor the photon-number-splitting attacks. It stands in sharp con- 
trast to our protocol. Figure 3 also shows the asymptotic key rate for the 
ideal decoy-state BB84 protocol”, in which the statistics of single-photon 
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Figure 3 | Key rates versus channel transmission. a, The rates for ei: = 0.03. 
b, The rates for e,;, = 0.11. Lines labelled (i)-(iii) represent the proposed 
protocol with L = 128, 32 and 16. The rates are optimized over the choice of 
Vin and the mean photon number Ll of a WCP through the relation 
Q=Lype ™"/2 and e.¢=1— 7 uw’ /v!. The optimized value of pu is 
around 0.05 for line (i) when 7 < 0.01. Lines labelled (iv)-(vi) represent 
BB84 protocols with double-pulse phase coding, using WCPs (iv), realistic 
single-photon source with the second-order correlation g(0) = 0.01 (v), 
and WCPs with infinite decoy states (vi). 


emission events are precisely characterized via decoy signals. Although 
the asymptotic rate is better than our protocol by one order of magni- 
tude, for a finite-sized key the decoy-state BB84 protocol suffers from a 
trade-off between the overhead of processing a large-sized key and the 
inefficiency of inserting many decoy signals to reach a required accu- 
racy in the estimation of parameters”"®. Our protocol is much simpler 
in this regard, requiring no sampling for determining Hp,. Asa result, 
a positive key rate is achieved even with N being as small as 10° (see 
Methods). For a higher bit-error rate, our protocol becomes better than 
the decoy-state BB84 even in the asymptotic limit, owing to its high tol- 
erance on the errors. If we consider the use of modern digital coherent 
communication systems with 40 Gbitss ' differential phase-shift sig- 
nals and assume the receiver’s overall detection efficiency to be 10%, we 
can generate a secure key at a rate of 200 bitss_' fora channel length of 
200 km and an error rate of 11%. 

The variable delay used in our scheme will be implemented as a series 
of switchable optical delay lines of T, 2T, 4T, 8T, ..., where T is the time 
interval between the neighbouring pulses. Because the delay is fixed for 
each train of L pulses, the switching speed can be much slower than T, 
and it affects only the duty ratio. 

The proposed QKD protocol demonstrates that spreading quantum 
information coherently over hundreds of quantum systems such as optical 
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pulses provides a novel way of utilizing it for secure communication. 
The fact that the quantum effect survives under large noise suggests 
that similar encoding techniques may be useful for other applications 
of quantum information working in the presence of noise. 


METHODS SUMMARY 


Bob’s alternative choices of measurements. Let +, denote summation modulo 
L. When a single-photon input state p is fed to measurement M, Bob announces 
{k, k+ 1} and obtains sp = s at probability (k,s||k,s) /2 with |k,s) : =(|k) +(—1)° 
k + 1))/V2. Given that (k, s| %) = 0 whens #5; ®s;4.,r, Bob’s guess sp is always 
equal to sq if he has received state | ¥,). The probability of announcing {i, j} (note 
that {j, i} is regarded as the same value) is calculated to be P({i,j}) = [P(i) + P(j)] 
54,75 +5)+,ri] /2, where P(k) = (k|p|k) is the probability of finding a photon in 
the kth pulse. 

The calculation of P({i, j}) for the case of Fig. 1b also leads to the same express- 
ion. This shows that the relation between the quantum signal received from Eve 
and the announced value {i,j} is identical for Fig. 1a and Fig. 1b. 

Derivation of secure key rates. The random phase shift 6 enables Alice to tag each 
of the rounds with v > vy, in principle*®. We assume that this tagged portion, at 
most Ne,,/Q bits, is fully leaked to Eve, leading to the — e,,./Q term in equation (4). 

For the untagged portion, it can be shown that the sequence 5155 - - - sz is equi- 
valent to the outcome of {|0), |1)}-basis measurement on L qubits prepared in a 
state fulfilling the promise that, if they are measured in a conjugate {|+),|—)}- 
basis, no more than v,,, qubits are found to be in the | —) state. The key bit s, = siOsj 
is then given by {|0), |1)}-basis measurement on qubit j after a controlled-NOT 
operation on qubits i andj. It can be shown that the probability of finding qubitj in 
the state |—) is at most vy,/(L — 1), leading" to the remaining term in equation (4). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Bob’s alternative choices of measurements. Let +, denote summation modulo 
L, and P(|¢)) : =|¢)(@|. Bob’s measurement M is fully characterized by a set of 
operators 


a). 1 5(W)+(— 0 etur) 
ks * 2 V2 

Are a a(n) 
..» L},s € {0, 1}) is given by Tr( AE, ) 


for single-photon input state p. From this outcome, he announces {k, k+ 7} (the 


(6) 


where the probability of outcome (k, s) (k € {1, 


order being irrelevant) and adopts sg = s. Given that (Y) [Ee |Y,) =0 when 
s #5; @sk+,r, Bob’s guess sp is always equal to sa if he has received state | 1). 


In measurement M, the probability P({i, j}) of announcing {i,j} (where i ¥ j) is 


given by = Tr(pky? )o iturj+ x Tr( pe} ) Oj+ir.is Where Oxy is 1 for x = y 
and 0 for x ¥ y. This is calculated to be 


P({ij}) =[P@) + P(A) [Sit rs + 5+ uni] /2 (7) 
where P(k) = (k|p|k) is the probability of finding a photon in the kth pulse. 

In Fig. 1b, Bob announces {k, k+ (— 1) (k= ., L; b = 0, 1) at probability 
P(k)/2. Noticing that P({i, j}) is contributed from the cases k = i and k = j, we see 
that it is also given by equation (7). This shows that the relation between the 
quantum signal received from Eve and the announced value {i,j} is identical for 
Fig. 1a and Fig. 1b. Therefore, if a statement regarding Eve’s knowledge about 
Alice’s sifted key is proved for Fig. 1b, it should also be true for Fig. 1a. 
Derivation of secure key rates. Here we give a security proof and derive the final 
key rate for the proposed QKD protocol shown in Fig. 2. We first show that Alice’s 
random bit sequence s, ... s, can be regarded as an outcome of Z-basis measure- 
ment on L qubits. Let | Y%) be the state of an L-pulse train emitted from the laser 
source. In the actual setup, Alice chooses s, ... s, randomly and applies phase shifts 
accordingly, resulting in the emitted state @,(—1)**"*|), where mm : = al ay is 
the photon-number operator for the kth pulse. Instead’*, she could prepare L 
qubits and the L pulses in an entangled state 


—L/2 ® > Isz) , ( 


los 0,1 


ye?) (8) 


where {|0),, |1);} is the Z-basis states of the kth qubit. The states of the L pulses are 
identical to those in the actual setup, and if Alice needs bit value s,, she may simply 
measure the kth qubit on the Z basis. 

It is useful for later discussion to ask what happens if Alice measures the L qubits 
in the X basis {|+),|—)} with |+):=271/?(\0) + |1)). Let n— be the number of 
qubits found in state |—). The statistics of n_ is related to the photon number 
distribution in | Y). In fact, it is seen from equation (8) that if the kth pulse contains 
an even number of photons, the state of the kth qubit is |+-),, and if the number is 
odd, the state is |—),. Hence n— is no larger than the total photon number. The 
argument so far holds for any pure state | Y), so it is also true when the source emits 
a mixed state. We thus conclude that, if the source fulfils equation (3) 


Pr(n_ > Vin) <esre (9) 


Next, we relate Bob’s apparatus in Fig. 2 to measurement M. We assume that the 
detectors can discriminate between a single photon from two or more photons, 
and that dark countings and inefficiency can be equivalently ascribed to a property 
of the transmission channel. Bob declares successful detection when a photon is 
detected from a superposed pulse and no other detection occurs in the whole pulse 
train. This ensures that the detected signal comes from a single-photon state. 
When the delay is r'T, the measurement is characterized similarly to equation 
(6) by operators 


af) += L(Y A) wa 


Ks * 2 V2 


except that it is defined only if 1 =k’ <L —1’. To see that this is equivalent to 
measurement M except at an efficiency of 1/2, introduce an auxiliary random bit c 
to definer = r' andk=k' ifc=0, whiler=L—r' andk=k' +1’ ifc=1.Thenr 
is uniformly random. Given r, the probability of outcome (k, s) for input state p is 
written as 
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Le (sai), 1 (Lr) 
Pr(k,s) = s1t(a ?) or 5 Tr aR” = 
where it is understood that Py? =0 fork’ =L—r'+1ork’ S0. It turns out that 


one of the terms always vanishes and Pr(k,s) = Tr( pe 2) j 2. We thus conclude 


that Bob’s apparatus is equivalent to measurement M preceded by a filter that 
allows only single-photon states to pass through with efficiency 1/2. 

To assess how much Eve knows about Alice’s key bit sa, we may assume that 
Bob carries out measurement M'. Learning i, b and r, Alice applies a controlled- 
NOT operation to the qubit i as control and qubit j = i + (— 1)’r(mod L) as target. 
The key bit s, = s;@s; in the original protocol is now equivalent to the outcome of 
Z-basis measurement on qubit j. If one measures this qubit in X basis instead, the 
probability of a ‘phase error’, namely, of finding it in the |—) state, is no more than 


ep» defined by 
src sre Vth 
mee 12 
ph Q T ( Q ) L-1 ( ) 


This is because the controlled-NOT operation does not affect the X eigenstates of 
the target, the index j is chosen uniformly from all qubits except the ith via random 
number r, and finally equation (9) ensures that among the N rounds contributing 
the sifted key, at least N — Nem€sre rounds satisfy n_ = vy, in the limit of large N. 
Then, if e,, < 1/2, she can extract a secure final key of length N[1 — h(e,,)] by 
privacy amplification''. Bob composes his sifted key from bit sg in each round. The 
error reconciliation will be achieved by letting Alice send Nh(epit) bits of encrypted 
information to Bob such that he can reconcile his sifted key to Alice’s. The net 
production length is then given by 


G=N{1—h(epit) —h(epn) | we) 


This rate can be improved by applying a common random optical phase shift 6 to 
all the L pulses in the actual protocol. This makes the emitted quantum state of the 
train into a classical mixture of states with fixed total photon numbers, enabling 
Alice to tag each of the rounds with v > vy, in principle though she need not do so 
in practice*’. We may then assume that Eve completely knows the sifted key bits 
for the tagged portion (at most Ne,,./Q bits), while the rest is treated as if e,,. = 0. 
This leads to equation (4) in the main text and is used in Fig. 3. 

If we omit the random optical phase shift 6, the rate for L = 128 and e,;, = 0.03 
decreases by about 10% from the rate shown in Fig. 3. On the other hand, if we are 
allowed to assume that the emitted photon number obeys a Poissonian distribu- 
tion, the rate for L = 128 and epi¢ = 0.03 is larger than the one shown in Fig. 3 by 
about 30% even if we omit the random optical phase shift. 

Finally, we briefly discuss an cee behaviour of our protocol for a finite- 
sized key. Let f (k; n,p) : =>) aos )" In! /[jl(n—j)!] be the tail distribu- 
tion for finding more than k successful mnie ina binomial distribution. Except for 
a probability €; : = =f (Nri; Nem:€src)» We may choose Nr, bits among the N sifted 
key bits to include all the tagged portion. We make no assumption about the phase 
errors for the chosen Nr bits. If we count the number of phase errors for the remain- 
ing N’:= N(1 — 1) bits, it should be no larger than N’r, except for a probability 
€ : =f(N'1o;N’,v,/(L—1)). The imperfection in the final key is characterized 
through the failure probability ¢ in identifying the phase error pattern when NHpa 
bits of error syndrome are given'®''”°. Given s > 0, we choose r, and r; to satisfy 
€) =€) =27S. Then, we have €<3 x 2° for Hp, = 1, + (1 — 7))h(r2) + s/N. The 
commonly assumed quality of the key corresponds to about s = 70 to 160. 

As an example, consider the case with L = 128, 7 = 2 X 10 ~3 and eyit = 0.03. 
The results below do not change significantly if 1 is chosen to be smaller, such as 
n= 10°. Asymptotically, the rate is optimized when pu = 0.0541 with vin, = 17, 
leading to py: = esrcNem/N = 0.047, po: = vin/(L — 1) = 0.134, and Hp, =Hpn : = 
pit(1—p1)h(p2) =0.588. For a finite length N, let us first use a crude Gaussian 
approximation of In[f (k; n,p)| = —(k— np)’ / [(2np(1 —p)]. This leads tor; =p; + 


/ (2In2)pi(s/N) and 1. = p2 + i (2imaypa 


numerics gives Hp, = HpA (1 +1.98,/s/N ). A better estimate is given by a rig- 
orous bound log, f(k;n,p) < —nD(k/n||p) with D(q||p): = glogo(q/p) + (1 — q) 
log,[(1 — q)/(1 — p)]. We calculated the finite-size rate after optimizing over yu 
and vy, for each N, and then derived its fraction R to the asymptotic optimal rate of 
1.16 X 10°. For s = 100, we found R = 3.3% for N= 10°, R = 53% for N= 10*, 
R= 84% for N = 10°, and R = 95% for N = 10°. 


(11) 


—p2) (s/N’) . For N>s, substituting 
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Future increases in Arctic precipitation linked to 
local evaporation and sea-ice retreat 


R. Bintanja' & F. M. Selten! 


Precipitation changes projected for the end of the twenty-first cen- 
tury show an increase of more than 50 per cent in the Arctic regions’. 
This marked increase, which is among the highest globally, has pre- 
viously been attributed primarily to enhanced poleward moisture 
transport from lower latitudes**. Here we use state-of-the-art global 
climate models’ to show that the projected increases in Arctic pre- 
cipitation over the twenty-first century, which peak in late autumn 
and winter, are instead due mainly to strongly intensified local sur- 
face evaporation (maximum in winter), and only to a lesser degree 
due to enhanced moisture inflow from lower latitudes (maximum in 
late summer and autumn). Moreover, we show that the enhanced sur- 
face evaporation results mainly from retreating winter sea ice, sig- 
nalling an amplified Arctic hydrological cycle. This demonstrates 
that increases in Arctic precipitation are firmly linked to Arctic warm- 
ing and sea-ice decline. As a result, the Arctic mean precipitation sen- 
sitivity (4.5 per cent increase per degree of temperature warming) is 
much larger than the global value (1.6 to 1.9 per cent per kelvin). 
The associated seasonally varying increase in Arctic precipitation 
is likely to increase river discharge * and snowfall over ice sheets? 
(thereby affecting global sea level), and could even affect global cli- 
mate through freshening of the Arctic Ocean and subsequent mod- 
ulations of the Atlantic meridional overturning circulation’®"’. 

Even though the moisture content of the global atmosphere will increase 
by roughly 7% per degree warming following the Clausius—Clapeyron 
relation, global average precipitation is projected to increase by only 
about 2% K_' in response to global warming", This ‘muted’ precip- 
itation response is generally attributed to radiative constraints on the 
amount of energy available for evaporation’’. Both observations and 
climate model simulations consistently show that Arctic precipitation 
trends far exceed this global average’, however. Over the course of the 
twentieth century, the Arctic experienced an increase in precipitation”, 
especially in winter’, which has largely been attributed to anthropogenic 
influences’’. Observational uncertainties are considerable'®, however, 
owing mainly to the paucity of observational data. According to almost 
all climate models Arctic precipitation will continue to increase and pos- 
sibly even accelerate in the twenty-first century’, with peak increases 
exceeding 50% (Fig. 1) over the Arctic Ocean. 

The potential consequences of such hefty moisture and precipitation 
increases are widespread: (1) more snow will fall onto adjacent continents” 
and ice sheets’, such as the northeastern part of the Greenland ice sheet, 
which will modulate ice-sheet growth and sea-level change; (2) the ocean 
surface density will drop owing to a net freshening of the Arctic Ocean 
(leading, incidentally, to increased stratification that may inhibit sea- 
ice melt’*), affecting the Atlantic meridional overturning circulation 
(AMOC) and thereby global climate’®"; (3) river discharge into the Arc- 
tic Ocean will become larger, reinforcing the freshening of the Arctic 
Ocean’®”””; (4) there will be stronger water vapour and cloud feedbacks, 
reinforcing polar warming” ”; (5) more snowfall on sea ice will poten- 
tially affect the ice-ocean albedo feedback"* and reduce sea-ice growth 
by increasing insulation; and (6) there are biological and ecological impli- 
cations, for instance, the greater availability of surface moisture for flora 
and fauna” as Arctic warming causes more precipitation to fall as rain. 


The origins of, and mechanisms behind, projected Arctic moisten- 
ing are still unclear, however. Is it caused by an intensified local Arctic 
hydrological cycle’, or does it result from a global hydrological response 
to altered evaporation rates and moisture fluxes in lower latitudes*? 
This is an important issue, for several reasons: (1) if the local response 
dominates, Arctic precipitation may be strongly linked to Arctic warm- 
ing and sea-ice retreat; (2) in that case the effect on Arctic Ocean fresh- 
ening will probably be limited because evaporation and precipitation 
effects on surface salinity will then largely cancel out (only a remote 
origin will lead to overall net freshening); and (3) the local and remote 
contributions probably exhibit a different seasonal imprint’, which is 
of relevance for all earth system components that depend on the sea- 
sonality of precipitation (such as ecology, ice sheets, hydrology)”. 

Most studies addressing Arctic precipitation have attributed the pro- 
jected increase mainly to enhanced poleward moisture transport and 
convergence****, Global atmospheric warming leads to an increase in 
atmospheric moisture according to the Clausius—Clapeyron relation. 
This strengthens meridional moisture gradients, indicating that the 
moisture transport will increase even if atmospheric dynamics remain 
invariant*®. However, climate warming may additionally lead to a pole- 
ward shift of the subpolar westerly jet stream’ and to subsequent changes 
in storm-track location and strength’, which affect the magnitude, lon- 
gitudinal distribution” and possibly the seasonality of the poleward atmo- 
spheric moisture transport, and thereby the contribution of remote source 
regions. Analyses of water isotopes as a proxy for the origin of Arctic 
moisture reveal that Arctic precipitation contains a mixture of local 
and remote origins whose ratio seems to fluctuate seasonally”*. The poten- 
tial contribution of increasing local evaporation (which peaks in autumn/ 
winter”) to Arctic precipitation trends has been addressed generically'"®, 
but as yet a systematic breakdown of local versus remote contributions 
is lacking. 

Here we use output from 37 state-of-the-art global climate models 
within the framework of CMIP5 (Coupled Model Intercomparison Pro- 
ject, phase 5)° to systematically quantify projected Arctic precipitation 
trends, as well as trends in surface evaporation and poleward moisture 
transport, to assess their relative contributions. We use standardized sim- 
ulations for the period 2006-2100 based on intermediate and strong 
forcing scenarios (see Methods Summary). The multimodel-mean twenty- 
first-century response shows marked relative increases in precipitation 
and surface evaporation that are largely confined to the Arctic Ocean, 
with peak values of up to 65% (precipitation) and 300% (evaporation) 
for the strong forcing scenario (Fig. 1). Moreover, the spatial coherence 
between peak regions in evaporation and precipitation’® (for example, 
in the northern Barents and Kara seas) suggests that surface evapora- 
tion changes, most probably reinforced by sea-ice retreat (Fig. 1c), con- 
tribute considerably to the amplification of future Arctic precipitation. 

The simulated intensification of Arctic (70°-90°N) precipitation varies 
by a factor of 3 to 4 between models (Fig. 2a) (see Methods). Any increase 
in Arctic precipitation has only two possible sources: enhanced local sur- 
face evaporation (local origin) or intensified moisture transport across 
70° N (remote origin). The relative contribution of these sources varies 
considerably between models (Fig. 2a), but on average surface evaporation 


1Royal Netherlands Meteorological Institute (KNMI), Utrechtseweg 297, 3731GA, De Bilt, The Netherlands. 
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Figure 1 | Geographical distribution of simulated annual mean 
twenty-first-century precipitation, surface evaporation, and sea-ice cover 
changes in the Arctic region. Changes (based on the difference between 

the means over 2091-2100 and 2006-2015) in precipitation (a), surface 
evaporation (b) and sea-ice extent (c), for the strong forcing scenario (RCP8.5). 
Precipitation and surface evaporation are presented as relative changes, 
whereas sea-ice cover is depicted as the absolute change in fractional coverage. 
Data represent the average of the CMIP5 climate models (see Methods). 
Simulated geographic patterns are very similar for the intermediate forcing 
scenario (RCP4.5), with maximum values of 27% relative change for 
precipitation, 131% relative change for surface evaporation and 28% fractional 
change for sea-ice cover. 


changes provide the major part of the Arctic precipitation increase 
(61.9 + 20.5% and 56.4 + 13.2% of the total for RCP4.5 and RCP8.5, 
respectively), even though moisture transport contributes as much as two- 
thirds to Arctic precipitation in the current climate (Methods). Although 
simulations show that moisture transport towards the Arctic will increase 
over the twenty-first century, the relative importance of moisture trans- 
port to Arctic precipitation will decline with climate warming. Precipi- 
tation minus evaporation will become larger, leading to net Arctic Ocean 
freshening (Methods), but much less than previously conjectured’. 


480 | NATURE | VOL 509 | 22 MAY 2014 


Transport (remote origin) 
Evaporation unrelated to sea ice (local origin) 
Evaporation due to sea-ice retreat (local origin) 


wo 
a 
jo} 
oO 


RCP8.5 
RCP4.5 


& 

° 

s 

© 
T 


Ais 

a 

be) 

oS 
T 


500 


Contribution to precipitation: 
increase (km? yr~1) 
6 
S 
S 


Pied 

° 

3S 

rs] 
T 


Model 


Contribution to precipitation increase (km? yr-') 


wo 
a 
jo} 
oO 


Q2Qa2ow 
66085 
S606 


400 


increase (km? yr-') 


De 
S 
is} 


Contribution to precipitation 


(0) 
& Sh > & Gh (oO 
RAAQEN FAW 
ion’ 


Contribution to precipitation increase (km® yr-') 


: Jan Feb Mar 


Apr May Jun Jul 
Month 


Aug Sep Oct Nov Dec 


Figure 2 | Simulated annual and monthly twenty-first-century changes in 
Arctic mean precipitation, poleward moisture transport across 70° N 
(remote origin) and surface evaporation components (local origin). 

a, Annual mean changes, where each bar denotes one CMIP5 model sorted 
according to the magnitude of the simulated precipitation change (horizontal 
black lines represent 50% of each column). b, Seasonal changes, where each 
bar represents the monthly and multimodel mean. Changes are given for the 
strong (RCP8.5) and intermediate (insets, RCP4.5) forcing scenarios. The total 
Arctic surface evaporation (the sum of the red and orange bars) is separated 
into its ice-retreat (red) and non-ice-retreat (orange) components. The 
attribution of sea-ice retreat to the total surface evaporation was evaluated 
by calculating monthly evaporative flux differences over only the ice-retreat 
regions, which were then summed over the respective 10-year periods 

(see Methods Summary). 


Sea ice is an important contributor to the enhanced surface evapo- 
ration rates”, especially in late autumn and winter (Fig. 2b). When sea 
ice retreats, the surface consists of open water at the freezing point instead 
of ice at temperatures far below zero, leading to enormous increases in 
surface evaporation in the ice-retreat regions**”’ (Fig. 1b). Asa result, a 
significant proportion of the total Arctic surface evaporative change 
can be attributed to sea-ice retreat (40.9 + 11.5% and 68.7 + 11.7% for 
the intermediate and strong forcing scenarios, respectively), especially 
during the cold winter months (Fig. 2b). Evidently, sea-ice retreat domi- 
nates the total evaporative response in the strong forcing scenario, which 
can be attributed to the projected strong retreat of sea ice (-48.7 + 17.5% 
relative change in sea-ice area) compared to the intermediate scenario 
(-21.1 + 12.9%). Changes in moisture transport peak in summer and 
autumn (Fig. 2b), when the increases in meridional temperature and 
moisture gradients are at their maximum (Arctic temperature changes are 
minimal in summer). Arctic precipitation increases peak in late autumn 
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and winter’” at more than twice the rate of late spring and summer in 
both scenarios, with strongly seasonally varying contributions from 
local and remote sources that seem to be consistent with water isotope 
analyses”. 

Sea-ice retreat is obviously related to Arctic warming, which peaks in 
winter, through the ice-ocean albedo feedback and the seasonal ocean heat 
storage/release, as well as through other ice-related climate feedbacks*””. 
Moreover, increased atmospheric moisture contributes to Arctic warm- 
ing through the water vapour and cloud feedbacks”, and as such con- 
stitutes a positive feedback that produces more evaporation through ice 
retreat’. Because models with strong Arctic warming generally exhibit 
considerable sea-ice retreat” (Fig. 3), there is a solid intermodel rela- 
tion between Arctic warming and changes in surface evaporation (both 
total and sea-ice-related), in both forcing scenarios (Fig. 3a and b). In 
contrast, we find only a minor and non-significant intermodel increase 
in meridional moisture transport changes as a function of Arctic warm- 
ing (Fig. 3c). This is probably due to counteracting effects such as a smaller 
meridional temperature gradient associated with Arctic tropospheric 
warming and a consequent reduction in synoptic activity’. Moreover, 
seasonality may affect this relation because Arctic warming peaks in 
winter whereas transport changes are largest in summer and early autumn 
(Fig. 2b). In any case, the dominance ofice retreat in Arctic evaporation 
increases leads to total precipitation changes being strongly linked to 
Arctic warming and its associated sea-ice retreat (Fig. 3d), which all 
peak in late autumn/winter (ref. 29 and Fig. 2b). Models that best sim- 
ulate the observed sea-ice retreat” exhibit above-average twenty-first- 
century changes in precipitation and evaporation (Fig. 3), suggesting 
that the best estimate for these changes would be even higher than the 
multimodel average. No intermodel relation is found between present- 
day moisture fluxes and their future changes. Hence, simulated changes 
in Arctic moisture budget can primarily be attributed to climate feed- 
backs related to Arctic warming and sea-ice retreat, and only to a lesser 
degree to the mechanisms behind the intermodel present-day spread 
in those components. 
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Enhanced surface evaporation induced by sea-ice retreat thus plays 
an important part in projected increases in Arctic precipitation. The 
intensifying local Arctic hydrological cycle significantly amplifies the 
sensitivity of Arctic mean precipitation (AP/(PAT) = 4.5% K_') over 
its global mean value (Fig. 4). The primary reason for this amplified 
precipitation sensitivity is that the evaporation sensitivity in the Arctic 
is much larger than the global mean, with sea-ice retreat more than dou- 
bling its value (Fig. 4). (The non-sea-ice-related evaporation sensitiv- 
ity AE/(EAT) is already larger than its global mean owing mainly to the 
reduced vertical stability of the lower troposphere, especially in the non- 
summer months”). Simulated atmospheric moisture transport sensitiv- 
ities are relatively small (~3% K _') compared to the Clausius—Clapeyron 
relation (6-7% K_'), which might indicate that subdued atmospheric 
dynamics and diminished meridional temperature gradients effectively 
reduce the moisture transport sensitivity. 

Projected twenty-first-century increases in Arctic precipitation rates 
of over 50% are among the highest globally, a consistent feature among 
state-of-the-art global climate models. Precipitation changes peak in 
late autumn and winter, with the local and remote contributions exhib- 
iting a pronounced seasonal cycle. Arctic warming and sea-ice retreat 
intensify the local hydrological cycle within the Arctic region through 
increasing the amount of open water and amplifying surface evaporation 
(Fig. 3b), especially in winter, causing strongly increased precipitation 
rates, in contrast to earlier findings*. The Arctic mean precipitation sen- 
sitivity (relative change per degree of warming) of about 4.5% Kis 
much larger than the global mean sensitivity of 1.6-1.9% K_', owing 
mainly to Arctic warming and sea-ice retreat. Evidently, the local hydrol- 
ogical cycle intensifies at the expense of the large-scale hydrological cycle 
in increasing Arctic precipitation rates. 

The associated breakdown between local (~60%, peaking in late 
autumn and winter) and remote (~40%, peaking in summer and early 
autumn) origins of Arctic precipitation changes is important for under- 
standing future Arctic Ocean surface salinity trends (Methods); this is 
because only precipitation from remote origins leads to net freshening 
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Figure 4 | Simulated twenty-first-century Arctic mean precipitation, 
evaporation and moisture transport sensitivities. The sensitivity in a variable 
X, where X is either precipitation (P), evaporation (E) or moisture transport (F), 
is defined as AX/(XAT), in which T is the Arctic mean temperature (note that 
for the evaporation components the denominator uses total evaporation). 
Crosses denote individual models. Black open squares indicate model-mean 
values, and vertical black error bars represent the associated intermodel 
standard deviations. The red and blue horizontal lines show the model-mean 
and global-mean precipitation sensitivities (which equals the global-mean 
evaporation sensitivity), with the surrounding band denoting the associated 
intermodel standard deviations. Results are given for the intermediate 
(red, n = 33) and strong (blue, m = 37) forcing scenarios. The black 
horizontal line represents the Clausius—Clapeyron relation. 

of the Arctic Ocean (in conjunction with increased river discharge’®”””). 
This, in turn, will affect the Atlantic ocean circulation (Methods) and 
thereby global climate, and counteract sea-ice melt'®. The seasonal imprint 
of precipitation increases affects snow accumulation on sea ice (affecting 
its seasonal growth and decay) as well as on ice sheets such as north- 
eastern Greenland (Fig. 1a) and adjacent continental areas’ (because 
the peak increase occurs during winter most additional precipitation 
will fall as snow), with possible modulations of global sea level and sea- 
sonal snow albedo feedbacks". 


METHODS SUMMARY 


In all analyses we used the Coupled Model Intercomparison Project, phase 5 (CMIP5) 
state-of-the-art global climate models’, which were applied in a series of standar- 
dized forcing scenarios for the period 2006-2100. Here we use the intermediate 
(RCP4.5) and strong (RCP8.5) forcing scenarios, for which the combined green- 
house, aerosol and other radiative forcings in the year 2100 totals 4.5 W m “and 
8.5 W m ”, respectively®. We use all models for which data coverage was complete 
and without obvious errors (other than that no selection of models was made); one 
ensemble member per model (the first) was used. Twenty-first-century trends in 
the Arctic moisture budget terms are defined here as the difference between the 
means over the periods 2091-2100 and 2006-2015, with all uncertainties through- 
out the paper being defined as the standard deviations of the model ensemble. 
In this study we define the Arctic as the region 70°-90° N because most of the pre- 
cipitation and sea-ice changes occur in the Arctic Ocean north of 70° N (Fig. 1). The 
moisture budget of the Arctic atmosphere can be expressed as 0Q/0t = F-(P-E), 
where Q is the total moisture content of the atmosphere (the sum of all atmospheric 
water vapour, water and ice), P is the total Arctic precipitation (downward), E is the 
total Arctic surface evaporation (upward) and Fis the poleward atmospheric mois- 
ture transport across 70° N. The Arctic atmosphere moisture reservoir is quite small; 
the mean residence time of atmospheric moisture is less than one week, meaning that 
OQ/ot < F, P, E. We thus evaluate the poleward transport using the balance: F = P- E. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Schematic of main results. The main conclusions of this paper have been sum- 
marized in Extended Data Fig. 1. It schematically depicts the twenty-first-century 
changes in the simulated hydrological cycle of the Arctic region, as well as the most 
important climatological consequences of these changes. 
Intermodel differences in moisture budget components. Here we present cur- 
rent and future CMIP5 (ref. 5) model-mean values of total Arctic precipitation, 
surface evaporation and the poleward moisture transport across 70° N for the 
intermediate forcing scenario RCP4.5 (Extended Data Table 1) and the strong 
forcing scenario RCP8.5 (Extended Data Table 2). We note that while the changes 
in local surface evaporation dominate the changes in Arctic precipitation, mois- 
ture transport is the largest term at present and this remains the case in the future, 
both for RCP4.5 and RCP8.5. To be able to compare the simulated present-day 
moisture budget of the Arctic with those evaluated from reanalyses data‘, we also 
calculated the moisture budget components for the Arctic defined as 60°-90° N. In 
the CMIP5 model ensemble (RCP4.5), present-day annual and Arctic mean (60°- 
90° N) moisture fluxes are: P= 17,368 + 1,911 km? yr 1, E= 7,961 + 1,133 km? yr! 
and F = 9,407 + 1,989 km’ yr_'. These are very close to present-day values evaluated 
from reanalyses data’: P = 17,408 km? yr 1 E=8073km* yr? and F = 8,830 km? yr. 
Extended Data Fig. 2 shows distributions of modelled precipitation, evaporation 
and moisture transport across 60° N. Evidently, even though the multimodel aver- 
ages are quite close to the reanalyses-derived values, there is substantial spread among 
models concerning the Arctic moisture budget components. The figures also depict 
the distributions for the future scenarios (RCP4.5 and RCP8.5), which exhibit an 
even larger spread. Interestingly, though, the intermodel spread in the present-day 
moisture budget components seems to have little bearing on the simulated twenty- 
first-century changes in these components (and the spread therein) because we 
find no appreciable intermodel correlations between the present-day values and 
their changes. Instead, the changes in moisture budget components are governed 
by Arctic warming and sea-ice retreat (see Fig. 3). Hence, we conclude that the 
simulated changes in Arctic moisture budget can primarily be attributed to climate 
feedbacks related to Arctic warming and sea-ice retreat, and only to a lesser degree 
to the mechanisms behind the intermodel present-day spread in those components. 
Effects of Arctic precipitation changes on sea surface salinity. A potentially far- 
reaching consequence of increased precipitation in the Arctic region is its effect on 
Arctic Ocean salinity, primarily because of its potential relation to (global) ocean cur- 
rents (see next section). Extended Data Fig. 3 depicts simulated twenty-first-century 
changes in average (70°-90° N) Arctic Ocean sea surface salinity Ssea-surface in the 
CMIP5 models for the intermediate (RCP4.5) and strong (RCP8.5) forcing scen- 
arios, respectively. With a few exceptions in the intermediate RCP4.5 scenario, all 
models exhibit an overall decrease in S,ea-surface» Consistent with the simulated increases 
in Arctic precipitation. However, a number of other climate mechanisms also poten- 
tially affect Arctic Ocean Scea-surface: local evaporation, increased river discharge and 
runoff*, sea-ice melt and altered ocean advection and mixing’®. Hence, Arctic Ocean 
freshening cannot be straightforwardly attributed to increases in precipitation. 
To quantify the effect of Arctic precipitation increase on Arctic sea surface salinity 
we carried out additional climate model simulations using the global coupled climate 
model EC-Earth”, which is part of the CMIP5 model ensemble. For this purpose we 
use exactly the same model settings as in the CMIP5 simulations that were carried 
out with EC-Earth. Two 44-year simulations were done for perpetual year 2006 
forcing conditions: a control simulation, and a simulation in which the precip- 
itation over the Arctic Ocean north of 70°N is artificially increased by 50%. Even 
though the climate will drift somewhat because of the applied artificial increase in 
precipitation (for instance, sea ice is found to expand), the difference between these 
two simulations will at least provide a rough estimate of how much increased pre- 
cipitation contributes to changes in Arctic Ocean Ssea-surface- Extended Data Fig. 4 
shows the simulated difference in the geographical distribution of S. Although 
there is a general decrease in Syea surface OVer Most of the Arctic Ocean (as well as in 
the North Atlantic), there is a region where Syea-surface exhibits an increase. The sim- 
ulation shows an overall increase in sea ice (not shown), which may be due to the 
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positive feedback of overall freshening on sea ice formation, or to internal climate 
variability. In regions with strong sea-ice expansion the associated brine rejection 
is apparently able to counteract the overall S.ea-surface decrease caused directly by 
increased precipitation (simulations with larger artificial precipitation increases 
exhibit a decrease in Ssea-surface throughout the entire Arctic Ocean). With a few sim- 
plifying assumptions (for example, that changes in Ssea-surface Scale linearly with 
those in the applied precipitation increase and that only remote moisture leads to 
net freshening) we estimate the contribution of increases in Arctic precipitation to 
Arctic Ocean surface freshening at about 25% for both the RCP4.5 and RCP8.5 
scenarios. Hence, according to these simulations, about a quarter of the decrease 
in Syea-surface Can be attributed to increased precipitation in the Arctic. 

To further assess the effect of local versus remote moisture sources on Arctic 
Ocean Syea-surface We Show in Extended Data Fig. 5 the intermodel relation between 
the ratio of the changes in moisture transport across 70° N and in surface evapo- 
ration (that is, the ratio of remote and local contributions), and simulated Syea-surface 
changes. The correlation is small (—0.63), presumably because of other mecha- 
nisms also affecting Arctic Ocean Ssea-surface: 48 Mentioned above. Nevertheless, the 
results suggest that models with a relatively large increase in moisture transport 
(remote sources) exhibit relatively strong reductions in Scea-surfacee Whereas models 
with strong Arctic evaporation increase (local sources) simulate a relatively small 
decrease in S.ea-surface: Hence, moisture originating from remote sources has the larg- 
est effect on mean Arctic Ocean Ssea-surface: to understand changes in Arctic Ocean 
Ssea-surface it is therefore important to know whether the increase in Arctic precipita- 
tion is due to moisture transport from remote sources or to local surface evaporation. 
Effects of Arctic precipitation changes on the AMOC. The CMIP5 climate runs 
(RCP45, RCP85) exhibit a significant reduction in AMOC in almost all climate 
models’, which is generally attributed to reduced surface ocean density in the deep- 
water formation regions in the North Atlantic (either by an increase in sea surface 
temperatures or by a decrease in Scea-surface)- Increases in Arctic precipitation and 
surface freshening of the Arctic Ocean and the associated outflow of this relatively 
fresh and light surface water towards the North Atlantic will potentially contribute 
to the AMOC decrease’. Here we investigate this contribution through a series of 
44-year simulations using the global climate model EC-Earth”®, in which we arti- 
ficially altered the precipitation rates over the Arctic Ocean north of 70° N, similar 
to the simulations described in the previous section. Arctic precipitation rate changes 
of -50%, +50%, +100% and +300% were applied in separate simulations, in 
addition to a control run without any change. In the previous section we showed 
that, as expected, increases in Arctic precipitation lead to an average surface fresh- 
ening in the Arctic Ocean, and also in the North Atlantic Ocean (Extended Data 
Fig. 4). 

In Extended Data Fig. 6 we show the temporal variation in annual mean AMOC 
strength at 30° N (definition used in ref. 2) for the various Arctic precipitation 
perturbation simulations. Evidently, increases in Arctic precipitation suppress the 
AMOC strength through exporting relatively fresh water to the North Atlantic, 
causing a reduced ocean surface salinity and density in the deep-convection zones. 
Averaged over the final 20 years, the control AMOC strength at 30° N equals 12.8 
+ 0.4 Sverdrup (Sv), whereas in the +50% Arctic Ocean precipitation simulation 
this value is 11.4 + 0.6 Sv (the uncertainty represents the standard deviation based 
on interannual variations in AMOC strength). Hence, according to these simula- 
tions, a 50% increase in Arctic precipitation leads to a significant decrease in AMOC 
strength of about 10%. Given that the model-average AMOC decrease over the 
twenty-first century in the CMIP5 model ensemble for the RCP8.5 scenario is about 
40% (ref. 2), and in EC-Earth only about 25%, (owing mainly to its comparatively 
weak AMOC in the present-day climate) the projected increase in Arctic precip- 
itation and the associated outflow of fresh surface water towards the North Atlantic 
contributes substantially to the projected decrease in the AMOC over the twenty- 
first century. 


30. Hazeleger, W. et al. EC-Earth: a seamless Earth system prediction approach in 
action. Bull. Am. Meteorol. Soc. 91, 1357-1363 (2010). 
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Extended Data Figure 1 | Schematic overview of the main processes magnitude of the changes. The red text and arrows represent the associated 
discussed in the paper. Blue arrows denote future changes in the Arctic climatic consequences. 
various moisture fluxes, with the size of the arrows loosely depicting the 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Precipitation 


~ 


CMIP5 models 


Number of ¢ 


lst 


14.75 15.75 1675 17.75 1875 19.75 20.75 21,75 22,75 23.75 24.75 


Arctic precipitation (10° km’ yr") 


Evaporation 


Number of CMIP5 models 


5.75 6.75 7.75 8.75 9.75 10.75 11.75 12.75 


Arctic evaporation (10° km? yr’) 


Moisture 

Present-t 
transport CPAs 
across 60°N RCP8.5 


Number of CMIP5 models 


7.25 8.25 9.25 10.25 11.25 12.25 13.25 


Moisture transport across 60°N (10° km’ yr") 


Extended Data Figure 2 | Number of CMIP5 models that simulate specific the number of CMIP5 models that simulate a specific amount. Distributions are 
amounts of precipitation, surface evaporation and moisture transport given for the present (green), future RCP4.5 (red) and future RCP8.5 (blue). 
across 60°N. Distributions of Arctic (60°-90° N) precipitation (a), surface Dashed lines denote the model-mean values, while the purple line shows the 
evaporation (b) and poleward moisture transport across 60° N (c) in terms of _ present-day value derived from reanalyses data’. 
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Extended Data Figure 3 | Simulated twenty-first-century changes in sea (b), sorted according to the magnitude of the change. The dashed line shows the 
surface salinity in the Arctic Ocean north of 70° N. Results are shown forthe —multimodel-mean value. The salinity is shown on the colour scale in practical 
intermediate forcing scenario (RCP4.5) (a) and the strong forcing (RCP8.5) salinity units (PSU). 
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Extended Data Figure 4 | Simulated changes in sea surface salinity resulting using the global climate model EC-Earth”. 


The salinity change is shown on 
from a 50% increase in precipitation over the Arctic Ocean north of 70°N __ the colour scale in PSU. 
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Extended Data Figure 5 | Simulated change in Arctic mean sea surface shows the best linear fit to the data, where R is the correlation coefficient and p is 
salinity against the ratio of changes in moisture transport across 70° Nand ___ the probability that uncorrelated data of the same sample size would yield a 
surface evaporation. The ratio is equivalent to the ratio of remote and local _ similar relationship. 

contributions. Each square represents one CMIP5 model. The straight line 
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Extended Data Figure 6 | Simulated AMOC strength as a function of Arctic _ strength at 30° N for the various Arctic precipitation (P) change simulations 
precipitation increase. Time series of simulated annual-mean AMOC (see main text for details) carried out with the global climate model EC-Earth””. 
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Extended Data Table 1 | Arctic moisture flux components and their changes for the RCP4.5 scenario 


Present-day Future Change (abs.) Change (%) 
Precipitation 5162 + 784 5990 + 833 827 + 478 +16 
Evaporation 1857 + 417 2389 + 651 532 + 382 +29 
Transport 3305 + 788 3678 + 756 296 + 182 +9 


Total Arctic (70°-90° N) moisture flux components for the present (2006-2015) and the future (2091-2100) based on the RCP4.5 forcing scenario, as well as the absolute (abs.) and relative twenty-first-century 
changes. Values are based on the CMIP5 model ensemble (see Methods Summary), with the uncertainties representing the standard deviations of the multimodel ensemble. Units are km? yr~?. 
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Extended Data Table 2 | Arctic moisture flux components and their changes for the RCP8.5 scenario 


Present-day Future Change (abs.) Change (%) 
Precipitation 5175 + 710 7118 + 1001 1943 + 600 +38 
Evaporation 1840 + 418 3042 + 718 1118 + 480 +61 
Transport 3335 + 736 4160 + 855 825 + 277 +25 


Total Arctic (70°-90° N) moisture flux components for the present (2006-2015) and the future (2091-2100) based on the RCP8.5 forcing scenario, as well as the absolute and relative twenty-first-century 
changes. Values are based on the CMIP5 model ensemble (see Methods Summary), with the uncertainties representing the standard deviations of the multimode! ensemble. Units are km? yr~?. 
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Uplift and seismicity driven by groundwater 
depletion in central California 


Colin B. Amos!, Pascal Audet’, William C. Hammond?, Roland Birgmann*”, Ingrid A. Johanson* & Geoffrey Blewitt? 


Groundwater use in California’s San Joaquin Valley exceeds replen- 
ishment of the aquifer, leading to substantial diminution of this 
resource’ * and rapid subsidence of the valley floor®. The volume 
of groundwater lost over the past century and a half also represents a 
substantial reduction in mass and a large-scale unburdening of the 
lithosphere, with significant but unexplored potential impacts on 
crustal deformation and seismicity. Here we use vertical global posi- 
tioning system measurements to show that a broad zone of rock 
uplift of up to 1-3 mm per year surrounds the southern San Joaquin 
Valley. The observed uplift matches well with predicted flexure from 
a simple elastic model of current rates of water-storage loss, most of 
which is caused by groundwater depletion’. The height of the adja- 
cent central Coast Ranges and the Sierra Nevada is strongly seasonal 
and peaks during the dry late summer and autumn, out of phase with 
uplift of the valley floor during wetter months. Our results suggest 
that long-term and late-summer flexural uplift of the Coast Ranges 
reduce the effective normal stress resolved on the San Andreas Fault. 
This process brings the fault closer to failure, thereby providing a 
viable mechanism for observed seasonality in microseismicity at 
Parkfield® and potentially affecting long-term seismicity rates for 
fault systems adjacent to the valley. We also infer that the observed 
contemporary uplift of the southern Sierra Nevada previously attrib- 
uted to tectonic or mantle-derived forces’”~° is partly a consequence 
of human-caused groundwater depletion. 

Hydrospheric mass changes exert direct influence over lithospheric 
deformation. Both seasonal and long-term changes to ice, snow or water 
loads may induce displacements of the Earth’s surface’*"* and can create 
stress perturbations that modulate activity on seismogenic faults'*°. A 
volume of groundwater approaching approximately 160 km? in California’s 
Central Valley has been lost through pumping, irrigation and evapo- 
transpiration over the past 150 years or so**. Historical and modern 
records demonstrate that groundwater depletion occurs primarily in 
the drier, hotter, southern portion of the basin’” (the San Joaquin Valley), 
parallel to the central San Andreas Fault and adjacent to the high topo- 
graphy of the southern Sierra Nevada (Fig. 1). Previous studies dem- 
onstrating sensitivity to small-scale stress changes across this section 
of the San Andreas point to seasonal hydrologic® or temperature” var- 
iations to explain observed changes in seismicity rates’ or strain in shal- 
low boreholes”. 

Modest contemporary uplift rates of the southern Sierra Nevada ob- 
served using space geodesy’° have been attributed to various tectonic and 
geomorphic drivers, such as buoyant response to mantle delamination”, 
Basin and Range extensional faulting”, and erosional mass transfer from 
Pleistocene glaciation”. Inferred stress change from epeirogenic uplift 
in the southern Sierra Nevada is also potentially linked to the transition 
from locked to creeping sections of the San Andreas Fault”. Here we 
seek to explore potential human impacts on contemporary deformation 
across this region through global positioning system (GPS) constraints 
on lithospheric flexure induced by anthropogenic groundwater level 
changes. 


Continuous GPS networks spanning the southwestern USA provide 
a high-resolution framework for analysing crustal motion in three dimen- 
sions (Fig. 1). Longer available records and recently improved processing 
techniques enable determination of reliable GPS vertical velocities with a 
precision” of less than a millimetre per year, tied to the Earth system’s 
centre of mass to within 0.5 mm yr — ' (refs 24 and 25). GPS station data 
were processed to create individual time series of vertical position and 
were then fitted with an empirical model including epoch position, ve- 
locity, and the amplitude and phase of annual and semiannual har- 
monic components (to model seasonal effects). This analysis includes 
only stations with at least two-and-a-half years of data and vertical rate 
uncertainties of =1.0 mm yr | (566 stations). Details of the GPS ana- 
lysis and processing techniques are included in the Methods. 

Figure 1 shows the distribution of vertical GPS rates across central 
California and east of the Sierra Nevada into the Great Basin. Long-term 
rates exclude GPS stations located within the San Joaquin Valley ground- 
water basin that display comparatively large signals owing to their 
position on soft sediment. These stations are affected by compaction- 
driven subsidence through loss of pore water®, poro-elastic deformation 
related to groundwater level fluctuations”, local irrigation and other 
processes unrelated to solid earth motion. 
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Figure 1 | Contemporary GPS vertical rates and groundwater decline. Map 
of vertical uplift rates from GPS stations (circles) spanning California and 
the western Great Basin. Stations in the valley showing anomalously larger 
signals and local irrigation effects are excluded. Contours show historical 
changes in the deep, confined aquifer’. The inset depicts the tectonic 
configuration of the Central Valley groundwater basin (SV, Sacramento 
Valley; SJV, San Joaquin Valley), and the San Andreas Fault (SAF). 
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Vertical rates across a transect from the central Coast Ranges to the 
relatively stable Great Basin range up to about 3mm yr’. The highest 
average velocities concentrate along the margins of the valley (Fig. 2), 
near the locus of the greatest historical changes in the deep confined 
aquifer’ (Fig. 1). The uplift rates of stations west of the San Joaquin Valley 
are more variable than those at bedrock sites in the Sierra Nevada, 
reflecting a higher proportion of stations on or near agricultural basins 
or active faults in the central Coast Ranges. Scatter in the vertical rate 
transect probably reflects shifts in the locus of groundwater change in 
comparison with historical averages’, as well as some aleatory variabil- 
ity. Nearby earthquakes, such as the San Simeon rupture in 2003 of 
moment magnitude 6.5, may also influence stations near the coast with 
anomalously high rates near 2mm yr _' (Fig. 2). In the Sierra Nevada, 
the highest vertical rates occur along the western slope and steadily 
decay to the northeast in the Great Basin (Fig. 2). Previous studies show 
a similarly poor or inverse correlation between elevation and GPS uplift 
rates across the Sierra Nevada’’. Modelling of postseismic viscoelastic 
relaxation following nearby historical earthquakes suggests that such 
transient strain accounts for only a fraction of the total observed ver- 
tical signal in the southern Sierra Nevada’. 

We compare vertical GPS rates with surface uplift predicted from an 
elastic half-space model simulating the response to load changes dri- 
ven by variations in total water storage (Fig. 2) (see Methods). The model 
accounts for surface and subsurface deformation induced by line loads 
distributed across a 60-km-wide strip over the surface of an elastic half- 
space representing the San Joaquin Valley, the site of long-term, his- 
torical groundwater unloading. Using a range of elastic parameters, we 
fitted the GPS-derived vertical velocities with a rate of unloading of 
8.8(£1.3) X10’Nm ! yr | (Fig. 2). We compare this estimate with 
the current average unloading rate in the valley based on changes in total 
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Figure 2 | GPS and model comparison. Swath profile of average 
contemporary vertical GPS velocity, annual GPS vertical displacement 
amplitude, and average topography from the central California Coast Range 
to the western Great Basin (SAF, San Andreas Fault; CT, Coalinga thrust). 
Vertical velocity and displacement amplitude are shown with 1o uncertainties. 
The profile includes data from 121 stations and encompasses areas of the 
greatest historical and current change to groundwater levels (boxed in Fig. 1). 
The average GPS velocity is well fitted by an elastic model simulating surface 
uplift resulting from the decline in total water storage (including groundwater 
loss) centred along and parallel to the San Joaquin Valley. Seasonal changes 
in the annual GPS displacement (peak-to-peak amplitude) are distributed 
more broadly over the San Joaquin drainage basin, reflecting distribution of 
winter precipitation, snow load and reservoirs. Blue bars show the width of 
both the long-term and the seasonal loads used in the elastic model. 
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water storage, measured using satellite gravimetry from the Gravity 
Recovery and Climate Experiment (GRACE) (ref. 3; see Methods). This 
change, measured between October 2003 and March 2010, yields an aver- 
age equivalent unloading rate of 7.2 (+2.1) X 107Nm_ ‘yr ‘slightly 
lower but in good overall agreement with the GPS-derived estimate. 

Assuming that changes in total water storage drive vertical motion 
surrounding the San Joaquin Valley, the GPS data reflect a combina- 
tion of short-term, elastic response to ongoing groundwater depletion 
and longer-term viscous relaxation reflecting the history of hydrospheric 
mass changes. Slight overestimation of the unloading rate based on the 
GPS results leaves open the possibility of either a time-dependent (vis- 
cous) component of the uplift signal, or a (smaller) contribution of on- 
going tectonic uplift. Both the overall match between the elastic model 
and observed GPS uplift (Fig. 2) and seasonal patterns inherent in the 
phase and amplitude of the GPS vertical time series, however, demon- 
strate the importance of the instantaneous (elastic) response to ground- 
water unloading. 

Annual peak uplift for GPS stations in the Coast Ranges and Sierra 
Nevada occurs in the late summer and early autumn (Fig. 3), correspond- 
ing with diminished snow and surface water loads” and overlapping with 
the end of the summer growing season and peak groundwater pumping 
in the Central Valley (May to September; ref. 2). In contrast, recharge 
during the early spring drives larger-amplitude peak uplift within the 
valley through poro-elastic effects of aquifer water levels (Fig. 3 and 
ref. 2). Seasonal vertical displacements are more broadly distributed 
than average, long-term trends (Fig. 2), reflecting the variability in total 
water storage in upland catchments feeding the San Joaquin Valley. Nota- 
bly, stations along the valley margins move upward annually in accord- 
ance with the Sierra Nevada and Coast Ranges during dry months and 
showa corresponding subsidence during the wet winter and spring (Fig. 3). 
These peripheral stations lead to the bimodal distribution for peak uplift 
times observed for the Central Valley (inset to Fig. 3), indicating that 
uplift from hydrospheric load changes dominates even the shortest- 
term signals in the vertical GPS measurements for stations unaffected 
by local irrigation or aquifer effects. 

Flexure in the central Coast Ranges due to seasonal hydrospheric un- 
loading in the San Joaquin Valley provides a viable mechanism to explain 
the annual modulation of seismicity on the San Andreas Fault. Both the 
locked and creeping fault sections at Parkfield (Fig. 1) exhibit an increase 
in the number of earthquakes greater than magnitude 1.25 during the 
late summer and autumn (Fig. 3), previously attributed to local changes 
in effective stress linked to the hydrologic cycle®. We explore the poten- 
tial seasonal impacts of unloading and uplift of the Coast Ranges on short- 
term changes in the fault-normal stress resolved across the fault using the 
elastic half-space model (see Methods). Taking estimates of annual varia- 
tions of 100-300 mm in equivalent water height in the San Joaquin Valley 
from GRACE data’ and a wider load encompassing the Coast Ranges and 
the western Sierra Nevada, the elastic model produces between 3 mm and 
8 mm of maximum annual vertical ground motion, in good agreement 
with the peak-to-peak amplitudes of annual uplift measured by GPS 
(Fig. 2). The corresponding fault-normal stress variations on the San 
Andreas Fault are near 1 kPa at seismogenic depths (Extended Data 
Fig. 4), with peak unclamping during the dry summer and autumn. 
Although we cannot rule out the potential feedback of reduced effective 
normal stress due to diffusion of pore fluids into fault zone rocks during 
recharge’, our results suggest that unloading may contribute to seasonal 
modulation of seismic activity on the central San Andreas Fault. A similar 
mechanism was invoked to explain annual variations in seismicity in the 
Himalayas’®. Stress changes due to groundwater unloading are somewhat 
higher for other historically active faults closer to the valley such as the 
Coalinga thrust system (Fig. 1), where peak-to-peak annual unloading 
gives rise to positive Coulomb stress changes of 1.0 kPa to 1.7 kPa (see 
Methods and Extended Data Fig. 4). 

Given that relatively small annual stress perturbations (around 1 kPa) 
appear sufficient to modulate earthquakes along the central San Andreas 
Fault®, we use the elastic model to estimate the stress rate caused by 
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Figure 3 | Seasonal peak uplift from GPS. Phase and peak-to-peak amplitude 
of annual vertical GPS displacement for all stations included in our analysis. 
The inset shows histograms of peak uplift phase (binned by month) for 
stations in the San Joaquin Valley, Sierra Nevada, and Coast Range. Peaks in 
seismicity rate for the locked and creeping San Andreas Fault at Parkfield 

are defined as months with higher than average declustered seismicity’. 
Smaller-amplitude peak uplift in the Sierra Nevada and Coast Range during 


ongoing human-induced groundwater removal (see Methods). Taking 
an estimate of the unloading rate due to loss in water storage in the San 
Joaquin Valley of 8.8 (+1.3) X 107Nm_' yr", we calculate a rate of 
unclamping of 0.07-0.18 kPa yr‘ for the San Andreas Fault. Coulomb 
failure stress rates on the Coalinga thrusts are higher, averaging 0.13- 
0.57 kPayr_'. Estimates of total groundwater loss from historical and 
more recent water level records (around 160 km’; refs 3 and 4) corre- 
spond toa reduction in the effective normal stress of 2.7-9.5 kPa for the 
San Andreas Fault, and a Coulomb stress change of 10-15 kPa at Coalinga, 
since the beginning of groundwater extraction in 1860. These results 
suggest that human activity may give rise to a gradual increase in the 
rate of earthquake occurrence, as suggested by earthquake catalogues 
in central California’. 

Future scenarios for groundwater in California suggest increasing 
demand for agricultural, urban and environmental use’. Climate change 
will probably exacerbate the stress on this resource through altered 
precipitation patterns, more frequent droughts, earlier snowmelt, lar- 
ger floods, and increasing temperatures and evapotranspiration”®*”. We 
demonstrate how long-term and seasonal hydrospheric mass changes 
due to groundwater removal significantly affect regional crustal defor- 
mation, which can lead to annual variations in small-earthquake fre- 
quency and longer-term changes of stresses on the San Andreas Fault. 
Our model explains recent vertical uplift rates previously attributed to 
epeirogenic deformation in central California”®, indicating that much 
of this signal results from anthropogenic groundwater removal. Taken 
together, our results highlight new and underappreciated links between 
human activity and solid earth processes driven by hydrospheric change. 


METHODS SUMMARY 

Elastic model. A two-dimensional model accounts for deformation from a normal 
line load distributed over a finite-width strip on the surface of the half-space’*”®. 
The vertical displacement rate u at the surface (z = 0) alonga profile perpendicular 
to the load centred at x = xq is given by: 


el = v)No 
mG(2a) 


iu(x,z = 0) [2a+ (x—xg —a) In|x—xq—al 


—(x—xg+a) In|x—xg+a|]+K 


the dry summer and autumn is largely out of phase with larger-amplitude 
uplift in the San Joaquin Valley driven by the poro-elastic response to recharge 
during the wetter winter and spring. Stations peripheral to the valley show 
uplift patterns in accordance with nearby sites on bedrock, indicating the 
dominance of surface loads in driving vertical motions away from areas 
affected by local irrigation and groundwater fluctuations. 


where x and zare the horizontal and vertical dimensions, respectively, Np is the rate 
of line load, v is Poisson’s ratio, G is the shear modulus, a is the strip half-width and 
Kisa far-field rate offset correction. The load is centred on the middle of the valley 
and covers its entire width (2a = 60 km). Given a range of elastic parameters (v = 
0.25, G = 35 + 5 GPa), we fit observed vertical velocities using a line-load rate with 
No = 8.8(+1.3) X 10’Nm ‘yr’? (Fig. 2). 
Unloading from groundwater loss. The Sacramento and San Joaquin basins lost 
an estimated volume of 30.9 (+2.6) km? in total water storage between October 
2003 and March 2010 (ref. 3). Total water storage change in the San Joaquin Valley 
(21.5 + 6.2 km*) equates to an unloading of 3.3 (+1.0) 10°N yr! over a valley 
length of 450 km, giving an equivalent line load of Ny = 7.2 (£2.1) X 10’Nm“ ‘yr’, 
in agreement with the elastic model. 
Unclamping of the San Andreas Fault. The unclamping rate (change in fault 
normal stress) on a vertical fault is calculated as*”: 

: No 

ba 2ma IC 


02) + sin(O, — 02) cos(0; + 62)| 

where 0, and 0, are the angles from the load edges to any point at depth, measured 
clockwise from the positive x direction. For the San Andreas Fault, located about 
70 km from the 60-km-wide load centre, we estimate an unclamping rate of 0.07- 


0.18kPayr * at seismogenic depths (5-15 km) from groundwater unloading. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


GPS data processing. We use data from continuously recording GPS stations bet- 
ween — 122° and —114° longitude and 34° to 38° latitude that had time series longer 
than 2.5 years that were >50% complete (Extended Data Fig. 1). This data set in- 
cludes 566 continuous GPS stations within this geographic region. No episodic 
campaign or semi-continuous stations were included. Percentage completeness is 
defined as the number of daily solutions for the station divided by the number of 
potential daily solutions given the duration of the time series. Larger values for per- 
centage completeness are preferred in order to best resolve seasonal oscillations of 
position. 

The data were processed as a part of a >11,000 station mega-network analysis 
system that retrieves data daily and updates solutions weekly. We use the GIPSY/ 
OASIS software provided by the Jet Propulsion Laboratory to estimate station coor- 
dinates every 24 h using the Precise Point Positioning method”’. lonosphere-free com- 
binations of carrier phase and pseudorange were obtained every 5 min. Calibrations 
were applied for all antennas, ground receivers and satellite transmitters. To model 
tropospheric refractivity, the Global Mapping Function was applied”, with tropo- 
spheric wet zenith delay and horizontal gradients estimated as stochastic random- 
walk parameters every 5 min (ref. 33). The observable model includes ocean tidal 
loading (including companion tides) coefficients supplied by Chalmers University**. 
Ambiguity resolution was applied to double differences of the estimated one-way 
bias parameters”, using the wide lane and phase bias method, which phase-connects 
individual stations to IGS stations in common view”. Satellite orbit and clock pa- 
rameters were provided by the Jet Propulsion Laboratory, who determine these 
parameters in a global fiducial-free analysis using a subset of the available IGS core 
stations as tracking sites. Output station coordinates are initially in the loose frame 
of the Jet Propulsion Laboratory’s fiducial-free GPS orbits. These were transformed 
into reference frame IGS08 using daily seven-parameter transformations supplied 
by the Jet Propulsion Laboratory. IGS08 is derived from ITRF2008 (ref. 37) and 
consists of 232 globally distributed IGS stations”. Finally, the solutions are aligned 
with our custom reference frame (NA12) that co-rotates with stable North America”’. 
This alignment is applied to each daily solution and provides a spatial filter to sup- 
press errors correlated at the continental scale. The mean formal uncertainty in 
daily vertical coordinate is <3.0 mm. A summary reference table for GPS solution 
methodology is given at http://geodesy.unr.edu/gps/ngl.acn. The resulting time ser- 
ies are used to estimate rates of motion with respect to stable North America. 
GPS time series analysis. An outlier exclusion criterion was applied to remove daily 
solutions that were more than 20 mm from the expected position based on a pro- 
visional time series model, or having position uncertainty greater than 20 mm. For 
each of the three components (east, north, up) of each GPS time series we solved for 
the intercept b, rate v, steps at the times f.4, of N known equipment changes that 
can introduce discontinuities of size D. We solve for annual and semiannual sea- 
sonal terms by solving for the cosine (C;) and sine (S;) terms. For example, the east 
component is: 


N 2 
e(t)=b+vt+ >> DjH(t—teapi) +) [Cycos(2mkt) + S;sin(2nkt)] 
i=1 k=1 

where 71(t) is the Heaviside step function, where f is in years. We use a damped 
linear inversion with weights on the prior uncertainties of the model parameters 
scaled to allow the expected range of values for each parameter. However, because 
only continuously recording GPS stations were used, all parameters in the estima- 
tion (including seasonal terms) were well constrained, so the damping parameters 
had little influence on the solutions. The amplitude of the annual terms were 
calculated with: 


A=(C'+8) 


day = (=) tan! (2) 
2m C 

Finally, stations were excluded if their rate uncertainty was greater than 1 mm yr* 
or their rate was clearly anomalous (that is, greater than 100 mm yr — 1) In all, we 
estimated solutions for 566 stations, 96% of which have time series duration greater 
than 4.0 years, and 92% of which are 80% or more complete (Extended Data Fig. 2). 
Example vertical GPS time series are shown in Extended Data Fig. 3. GPS time series 
parameters, including vertical uplift rate and the amplitude and timing of peak 
uplift are included as source data for Fig. 1. 
Elastic half-space model. We use an elastic half-space model for calculating the 
elastic response of the crust to changes in total water storage. The model is two- 
dimensional and accounts for surface and subsurface deformation induced by a line 
load over the surface of the half-space along a finite-width strip (half-width a, 


and the day of year from: 
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Extended Data Fig. 4a). Model derivation can be found in ref. 30, with a small 
correction from ref. 13. The normal displacement of the surface (z = 0) along a 
profile perpendicular to the load centred at x = x, is given by 


el ae v)No 


u(x,z=0) = “nG(2a) 


[2a+(x—xq—a) In|x —xg—al—(x—xa +a) In|x—xg+a|]+K 


where x and z are the horizontal and vertical dimensions, respectively, No is a line 
load, v is Poisson’s ratio, G is the shear modulus, a is the strip half-width and K isa 
far-field rate offset correction. We fix the position of the line load to the centre of 
the San Joaquin Valley, and use a load width of 2a = 60 km corresponding to the 
average width of the valley floor. Given a range of elastic parameters (v = 0.25, 
G = 35 + 5 GPa), we can fit the vertical velocities derived from the GPS data by a 
line-load rate with Ny = 8.8(+1.3) X 10’Nm'! yr! (see Fig. 2). Goodness of fit 
is estimated using a reduced chi-square criterion. Our best-fit model has a reduced 
chi-square of 1.9, indicating a reasonably good fit. 

Unloading from estimated mass loss. The combined Sacramento-San Joaquin 
River basins lost an estimated volume of 30.9 (2.6) km? in total water storage over 
a 6.5-year period between October 2003 and March 2010 (ref. 3). Of this volume, 
20.3 (+3.8) km? is attributed to loss of groundwater in the Central Valley, 80% of 
which occurred in the San Joaquin Valley. Assuming that changes in total water 
storage not related to groundwater (for example, soil moisture, surface water and 
snow) are distributed equally between the Sacramento and San Joaquin basins, the 
total water storage change in the San Joaquin Valley over the 6.5-year period is 
21.5 (+6.2) km?. This change is equivalent to a rate of mass loss of 3.3 (+1.0) x 10° 
gigatons per year, and therefore a rate of unloading of 3.2(+1.0) x 10° Nyr! 
distributed over the 450 km length of the valley. Taking into account the range of 
uncertainties in groundwater and total water storage change’, the equivalent line- 
load rate is thus Ny = 7.2 (+2.1) X 107Nm ! yr, in agreement with the estimate 
from the elastic model based on the vertical GPS velocities. If we add the estimated 
1860-2003 depletion of 140 km? (ref. 4) to the 2003-2010 water loss, the cumulative 
historic load decrease amounts to ~3.5 X 10?Nm_‘. 

Stress modelling. The two-dimensional stress components at any point at depth 
caused by a distributed line load No (in Nm ~ 1 the negative value indicates unload- 
ing) at the surface of the elastic half-space are calculated as”: 


N 
Tag = a [(0, —02) + sin(0, —02) cos(0; + 02)| 


Tzz [(O; 02) sin(0; 


02) cos(0; = 62)] 


N 
Tag = oe [sin(01 — 02) sin(0, + 02)] 


where 0, and 0, are the angles from both edges of the load measured clockwise 
from the positive x direction, a is the load half-width and z is positive downward 
(Extended Data Fig. 4a). The shear and normal stress components resolved on a 
fault plane dipping at an angle g with a strike direction normal to the xz plane are 
given by: 


Tn = Tze COS*() —2Txz sin(g) cos(~) +7; sin’(g) 


1, = (Tz —Txx) Sin(g) cos(g) + Txz (cos”(g) — sin’(g)) 


Using Coulomb failure assumptions”, the Coulomb failure stress is expressed as: 
Go =|t|+ H(t —p) +S 


where 4 is the coefficient of friction, p is pore-fluid pressure and S is cohesion. 
Assuming that p, S and 1 are constant over time, the change in Coulomb stress is 
given by”: 


Ao, = A\t,| + wAty 


where Az, is positive for tension” Similarly, the Coulomb stressing rate is given 
by: 


G-=|tz| + Hin 


The first term on the right-hand side is valid for an isotropic failure plane. We 
consider the change in shear stress in the slip direction (or rake) and vary the sign 
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of the change in shear stress accordingly. For the San Andreas Fault we consider a 
dip angle of 90° and the unclamping stress (or stress rate) is given simply by At. 
(or t,,). For the Coalinga blind thrust we use dip angles ranging between 15° and 
30° (refs 40 and 41) and a negative change in shear stress. We also consider a 65° 
northeast-dipping fault with a positive slip direction, simulating stress conditions 
on the Nufiez thrust fault. For each fault we calculate the normal and shear stress 
components at depths of 5 km and 15 km, representative of the seismogenic depth 
in Central California. 

Rate of unclamping of the San Andreas Fault. The rate of unclamping on a ver- 
tical fault (change in fault normal stress) can be calculated as”: 


_ No 


or [(0; — 02) + sin(0, — 02) cos(@; + 02)| 


Txx 
where 0, and 0; are the angles from both edges of the loading strip to any point at 
depth, measured clockwise from the positive x direction. Taking an estimate of the 
unloading rate due to loss in water storage of 8.8 (+ 1.3) X 107 Nm ‘yr’, as well 
as average spatial parameters for the San Andreas Fault (load width of 2a = 60 km; 
distance to centre of the distributed line load of around 70 km; seismogenic depths 
of 5-15 km), we estimate a rate of unclamping of 0.07-0.18kPayr ! at the seis- 
mogenic depth along the San Andreas Fault due to the elastic rebound caused by 
unloading (Extended Data Fig. 4b and c). 

For the Coalinga blind thrust, we calculate the normal (t,,) and shear (,) stress 
rates resolved on a range of fault planes and depths. The Coulomb failure stress rate 
(ignoring pore pressure), 6. =|t,|+ Hn (ref. 30), is then calculated with yt = 0.7. 
Seasonal vertical displacements and stress changes. The elastic model can be used 
to estimate total vertical displacements caused by variations in total water storage. 
The model considered is more broadly distributed over the Coast Range, the San 
Joaquin Valley and the western Sierra Nevada (width of 200 km; Fig. 2), corres- 
ponding to the approximate total width of the San Joaquin River drainage basin. 
Taking an average of 200 mm of change in total water storage, we estimate a load 
of ~4 X 10°Nm_ |, which produces annual vertical displacements of 3-8 mm within 


the load extent (Fig. 2). The associated stress variations on the San Andreas Fault are 
about 1 kPa and the Coulomb stress changes on the Coalinga thrust are 1.0-1.7 kPa 
(Extended Data Fig. 4d and e). Varying the width of the distributed line loads by 
+20 km and the load centre by +10 km has only a small impact (<0.5 kPa) on the 
resolved seasonal stress changes on the faults. 
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Extended Data Figure 1 | Vertical GPS data. Map of vertical uplift rates spanning California and the western Great Basin, including stations within the Central 
Valley groundwater basin. EL, elevation; GPS vert., GPS vertical velocity. 
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Extended Data Figure 2 | GPS time series. The top panel shows a histogram of 
the duration of GPS time series. 96% are longer than 4.0 years. The bottom panel 
shows a histogram of the percentage of complete time series. 92% are greater 
than 80% complete. 
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Extended Data Figure 3 | GPS time series. Example vertical GPS time 
series for stations P056 and P570. Blue dots are daily vertical position solutions, 
red lines are the model estimated to represent the time series. The station 


P056 is in the southern Central Valley near Porterville, California. The station 
P570 is near Weldon, California, east of Lake Isabella. 
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Extended Data Figure 4 | Stress profiles from distributed line loads of an 
elastic half-space. a, Model setup, where Np is the line-load rate, a is the load 
half-width, and 6, and @, measure the angle downward from the positive x 
axis to any point (xo, Zo) at depth. t and Ac, give the stress rates and Coulomb 
stress rates, respectively. b, c, Stress rates at depths of 5 km (b) and 15 km (c) 
calculated from the long-term rate of unloading over the San Joaquin Valley 
with a = 30km. Vertical dashed lines labelled SAF and CF represent the 
locations of the San Andreas Fault and the Coalinga blind thrust faults 
(including the Nufez fault), respectively, relative to the load centre. Black 
and coloured lines indicate stress components and Coulomb stress changes, 
respectively. The blue curves labelled N show Coulomb stress calculations for 
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favourably oriented faults with a 65° dip, representing the Nufez fault. For 
the red curve representing the Coalinga fault (CF), the calculations are 
performed using a dip of 30° for unfavourably oriented faults. The green 
curve represents unclamping stress for vertically dipping faults such as the San 
Andreas Fault. d, e, Stress changes at depths of 5 km and 15 km from seasonal 
(peak-to-peak) load changes over the San Joaquin River Basin with a = 100km 
(full width of 200 km). The position of the San Andreas Fault and Coalinga 
faults reflects displacement of the load centre by 30 km relative to the long-term 
load. Varying the load half-width and the load centre by +10 km has only 

a small impact (<0.5 kPa) on the resolved seasonal stress changes on the 
faults. 
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High-throughput screening of a CRISPR/Cas9 
library for functional genomics in human cells 


Yuexin Zhou'*, Shiyou Zhu'*, Changzu Cai'*, Pengfei Yuan’, Chunmei Li’, Yanyi Huang? & Wensheng Wei! 


Targeted genome editing technologies are powerful tools for study- 
ing biology and disease, and have a broad range of research applications’. 
In contrast to the rapid development of toolkits to manipulate indi- 
vidual genes, large-scale screening methods based on the complete 
loss of gene expression are only now beginning to be developed*”. 
Here we report the development of a focused CRISPR/Cas-based 
(clustered regularly interspaced short palindromic repeats/CRISPR- 
associated) lentiviral library in human cells and a method of gene 
identification based on functional screening and high-throughput 
sequencing analysis. Using knockout library screens, we successfully 
identified the host genes essential for the intoxication of cells by 
anthrax and diphtheria toxins, which were confirmed by functional 
validation. The broad application of this powerful genetic screen- 
ing strategy will not only facilitate the rapid identification of genes 
important for bacterial toxicity but will also enable the discovery of 
genes that participate in other biological processes. 

Recent progress in genomic editing, such as ZFN (zinc finger nuclease)*~, 
TALENs (transcription activator-like effector nucleases)*° and the 
CRISPR/Cas system®”’, has markedly changed the way that researchers 
study genes and their functions in mammalian systems. The CRISPR/ 
Cas system was originally found to provide bacteria and archaea with 
adaptive immunity against viruses and plasmids'’®. By means of the 
type II CRISPR system, the Cas9 nuclease can be directed by a chimae- 
ric single guide RNA (sgRNA) via Watson-Crick base pairing” to the 
desired genomic loci followed by an NGG protospacer-adjacent motif 
(PAM) to create double-stranded DNA breaks (DSBs)°”. Taking advan- 
tage of this effective gene-editing technology with easy programmabi- 
lity, we developed a focused sgRNA library that enables gene identification 
from function-based genetic screening. 

The CRISPR/Cas system requires both the gene-specific sgRNA and 
an identical nuclease Cas9 to achieve DSBs at the targeted locus®”’. 
We developed cell lines with a constant expression of Cas9 with OCT1 
(Fig. la and Supplementary Fig. 1), as it has been reported that over- 
expression of OCT 1 would further boost the sgRNA expression through 
enhanced U6 promoter activity’*. We used the Golden Gate cloning 
method” to construct the sgRNA on the lentiviral backbone (Fig. 1b 
and Supplementary Fig. 1). To verify whether the expression of ssRNA 
through lentiviral delivery is sufficient to guide Cas9 to the targeted loci 
to generate DSBs, we arbitrarily selected the CSPG4 gene for targeting 
and conducted the T7E1 digestion assay as previously described’*. CSPG4- 
specific sgRNA was found to induce insertions and deletions (indels) 
through viral infection in all three lines constantly expressing Cas9 and 
OCT1 (that is, HEK293Toc, HT10809¢ and HeLagc, where OC indi- 
cates OCT1 and Cas9). However, the pooled HeLagc cells showed 
lower efficiency in creating indels than the pooled HEK293To¢ and 
HT10809¢ cells at day 5 after infection, whereas one isolated single 
clone of HeLagc, designated as HeLagc-SC, showed the highest level 
of efficiency (Fig. 1c). This difference in indel efficiency is unlikely to 
be due to the variable levels of Cas9 expression (Extended Data Fig. 1), 
and prolonged culturing enabled HeLaoc at day 8 to reach similar 


cleavage efficiency as HEK293Toc and HT10809¢ at day 5 (Fig. 1c), 
although still significantly lower than HeLagc-SC. It is therefore crit- 
ical to isolate a single clone with high DSB mis-rejoining activity, at 
least for certain cell types. The beneficial role of exogenous expression 
of OCT1 requires further investigation. 

We next examined the off-target activity of virally delivered ssRNA 
in HeLaoc-SC. Three representative sgRNAs targeting three genes 
(ANTXR1, HBEGF and CSPG4) were chosen for this analysis. The 
top five off-target sites were selected for each sgRNA, and the T7E1 
assays indicated that there were no off-target cleavages except for one 
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Figure 1 | Lentiviral-delivered sgRNA creates indels with high efficiency in 
human cells stably expressing Cas9 and OCT1. a, Structure of the lentiviral 
plasmid expressing 2A-linked OCT1 and Cas9. An IRES-linked blasticidin- 
resistant gene was fused with the cas9 gene to facilitate the selection of cell 
clones with high-level expression of Cas9 and OCT1. The pLenti-CMV-BSD 
vector was used for plasmid construction. nt, nucleotides; LTR, long terminal 
repeat. b, Schematic diagram of the sgRNA expression construct in a lentiviral 
backbone. U6 promoter-driven ccdB and sgRNA scaffold were cloned into the 
LL3.7 lentiviral vector. Annealed oligonucleotide pairs encoding customized 
sgRNA were ligated into the lentiviral backbone using the Golden Gate method. 
c, Indels induced by lentivirus-delivered sgRNA (5’-TTGGCCAGACTTG 
CATCCG-3’) targeting the CSPG4 gene (CSPG4-specific sgRNA3) in the 
indicated cells were assayed by T7E1 digestion. All cells were incubated for 5 or 
8 days after infection before assay as indicated. Primers used for the PCR 
reactions are listed in Supplementary Table 5, and the percentage of cleaved 
band was measured using the ImageJ program (http://rsbweb.nih.gov/ij/) for 
this and other figures. 
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case, in which the off-target site of sgRNA targeting HBEGF has exactly 
the same sequence for the last 15 base pairs immediately upstream of 
PAM (Supplementary Table 1). All of the off-target sites tested were 
located in the non-coding regions, and therefore unlikely to alter gene 
function. 

To establish our methodology, we created a library targeting 291 
human genes that fit our specific research interests. Assisted by a rule’- 
based algorithm, we designed three kinds of sgRNAs targeting each 
gene for a majority of the 291 genes (Supplementary Table 2). The 
single clone HeLagc-SC was chosen to produce the library because this 
particular line generated the highest level of indels (Fig. 1c). The lib- 
rary, designated as HeLa/sgRNA'”"™”, was created in such a way that 
the coverage of sgRNAs is approximately 1,000-fold with a virus MOI 
(multiplicity of infection) of 0.05. We then applied this library to a 
functional screening process using two bacterial toxins: diphtheria and 
chimaeric anthrax (PA/LFnDTA, protective antigen (PA)/N-terminal 
domain of lethal factor (LF) fused to the catalytic subunit of diphtheria 
toxin) toxins. Figure 2 displays a schematic of the library construction 
and functional screening. After three rounds of toxin treatment, the 
surviving cells were collected, most of which exhibited green fluores- 
cence (Extended Data Fig. 2), indicating the active presence of sgRNA- 
containing cartridges that carry CMV promoter-driven EGFP (Fig. 1b). 
The genomic DNA of the pooled surviving cells from the library, as 
well as the original library of cells before the toxin treatment, was 
extracted and used for PCR amplification of the sgRNA-coding regions, 
before being subjected to deep-sequencing analysis (Fig. 2). These 
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Figure 2 | Schematic of sgRNA library construction and functional 
screening. sgRNAs were delivered into HeLag¢-SC cells by lentiviral infection 
with a MOT of 0.05. Library cells stably expressing sgRNAs were obtained by 
FACS for green fluorescence 2 days after infection. Library screening was 
conducted by three rounds of toxin treatment, followed by PCR amplification 
of the sgRNA-coding sequence integrated into the chromosomes. Primers used 
for the PCR reaction are listed in Supplementary Table 5. Purified PCR product 
was subjected to high-throughput sequencing analysis with lumina 

HiSeq 2500. 
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sgRNA-coding regions can be amplified rather specifically without 
interference from the host chromosomal DNA (Extended Data Fig. 3). 

High-throughput sequencing analysis revealed a total of 863 (99.3%) 
sgRNA sequences from the control library that was designed to contain 
869 kinds of sgRNA targeting a total of 291 genes (Supplementary 
Table 2). The read counts for most of the sgRNA coding sequences were 
consistent among three repeats (Supplementary Table 3 and Extended 
Data Fig. 4a) and after prolonged culturing (Extended Data Fig. 4b). 
After three rounds of toxin treatment, however, the composition and 
the read count of the sgRNAs showed distinct differences between the 
two different screens. We organized all the log,-fold changes of the 
normalized counts into two heat maps, ranked according to their 
enrichment level from either PA/LFnDTA or diphtheria toxin selec- 
tion. Samples of the top hits in each screening were also shown with 
their gene names and sgRNA numbers. Most of the sgRNAs were 
depleted in both cases, as expected (Fig. 3a). Using the analytical tool 
DESeq2 (ref. 15), we selected 21 sgRNAs targeting 19 genes from PA/ 
LFnDTA screening and 15 sgRNAs targeting 15 genes from diphtheria 
toxin screening as potential candidates for future studies (Figs 3b, c 
and Supplementary Table 4). Notably, all three sgRNAs targeting the 
same known anthrax toxin receptor gene ANTXR1 (also called TEM8) 
(ref. 16) were enriched and ranked at the top from the screening of PA/ 
LFnDTA (Fig. 3b), and one of the two sgRNAs targeting the diphtheria 
toxin receptor gene HBEGF (ref. 17) ranked at the top from the screen- 
ing of the diphtheria toxin (Fig. 3c). 

Notably, the three ANTXR1-targeting sgRNAs varied in their effi- 
cacy of inducing gene interruption, as shown by the rates of indels 
determined by a T7E1 digestion assay. The two most strongly enriched 
sgRNAs (sgRNAI and sgRNA3) showed a much higher efficiency in 
causing indels at the targeted locus of the ANTXR1 gene (Fig. 3d). Con- 
sistently, the two sgRNAs targeting HBEGF also had a distinct effi- 
ciency in inducing indels (Fig. 3e), with HBEGF sgRNA3, with a much 
higher level of efficiency, being enriched and ranked at the top of the 
diphtheria toxin screening results (Fig. 3a), whereas the other ssRNA 
targeting HBEGF ranked at number 35 (Supplementary Table 4). 

To validate the functions of candidate genes identified from the 
library screening, we generated individual gene knockouts in HeLa 
cells for ANTXRI and HBEGF. The ANTXRI knockout in HeLa cells 
was created by either the TALEN technique’* or CRISPR/Cas9 system 
(Extended Data Fig. 5a, b). We tested all three mutant clones (one created 
by TALENs and the other two created by CRISPR/Cas9) for their 
susceptibility to the two toxins using both XTT (2,3-bis-(2-methoxy- 
4-nitro-5-sulfophenyl)-2H-tetrazolium-5-carboxanilide) cell viability 
and LDH (lactate dehydrogenase) cell death assays. The HeLa ANTXRI/~ 
became totally resistant to PA-mediated toxicity, while remaining sus- 
ceptible to the diphtheria toxin (Fig. 4a and Extended Data Fig. 5c, d). 
Similarly, HeLa HBEGF ~~ was completely resistant to diphtheria toxin, 
but not to PA/LFnDTA for two independent clones created through 
the CRISPR/Cas9 system (Fig. 4b and Extended Data Fig. 6). The iden- 
tification and confirmation of these two known toxin receptors demon- 
strates the high level of efficiency and reliability of the CRISPR/Cas9 
library in functional screening. 

To determine whether our screening revealed novel genes involved 
in the two toxins’ intoxication mechanisms, we picked candidate genes 
from the top of sgRNA enrichment lists (Fig. 3b, c) for further valid- 
ation. Because only one sgRNA was positively selected for most of the 
candidates, we designed additional ssRNAs (Supplementary Table 6) 
and examined their capacity to produce a toxin-resistance phenotype. 
Four genes (PLXNA1, FZD10, PECR and CD81) were confirmed to be 
potential functional candidates specifically involved in PA-mediated 
anthrax toxicity because at least two different sgRNAs for each can- 
didate produced a toxin-resistance phenotype (Fig. 4c and Extended 
Data Fig. 7). As there was only one CFTR-targeting sgRNA conferring 
resistance, further analysis was required to rule out the possibility that 
this is not due to an off-target effect. Similarly, the involvement of 
RAB2A in diphtheria toxin toxicity was supported by the effect of 
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multiple RAB2A-targeting sgRNAs in HeLagc-SC cells (Extended 
Data Fig. 8). 

To assess further the candidate genes isolated from our library 
screening and the above validation procedure, we chose the PECR gene 
for further extensive study. PECR gene knockouts were isolated through 
the CRISPR/Cas9 system using two different ssRNAs, and the complete 
disruption of PECR was confirmed by Sanger sequencing (Extended 
Data Fig. 9a, b). The complete loss of PECR in both independently 
acquired clones conferred on HeLa cells increased levels of resistance 
to PA/LFnDTA, but not to diphtheria toxin (Fig. 4d and Extended Data 
Fig. 9c). PECR-specific short hairpin RNAs (shRNAs) also generated a 
resistant phenotype in HeLa cells to PA-mediated killing (Extended 
Data Fig. 9d), further demonstrating the positive role of PECR in 
anthrax toxicity. 

In addition to ANTXRI1, we also designed three sgRNAs targeting 
the other anthrax toxin receptor gene, ANTXR2 (ref. 19). Only two 
sgRNAs (sgRNA2 and sgRNA3) were retained in the library. Although 
sgRNA2“N'*®? induced indels at an efficiency of 21.3% by T7E1 assay, 
neither sgRNA generated knockout cells that survived the PA/LFnDTA 
toxin treatment (Supplementary Tables 3 and 4). Using two pairs of 
ANTXRI1- and ANTXR2-targeting TALENs construct", we introduced 
each pair of plasmids into HeLa and HEK293T cells. From randomly 
picked clones harbouring either of the two pairs of TALENs constructs, 
toxin-resistant clones were identified only from the HeLa TALENs*NT*®! 
(19 of 50) and the HEK293T TALENs“N?*®? (5 of 32), but not from the 
HeLa TALENs“N?*®? (0 of 50) or the HEK293T TALENs“NT*®! (0 of 
32) cells (Extended Data Fig. 10a), suggesting that these two receptors 


Indels (%) 3.2 19.5 


have different roles in these two cell types. Reverse transcription PCR 
analysis revealed that the basal expression of ANTXR2 transcripts 
was barely detectable in HeLa cells (Extended Data Fig. 10b). This 
explained why ANTXR1, but not ANTXR2, has such a dominant role 
in HeLa cells. As a more potent receptor”, ANTXR2 instead of ANTXR1 
has a more significant role when the expression of both genes was com- 
parable in HEK293T cells (Extended Data Fig. 10). 

Although arrayed and pooled screens using RNA interference 
(RNAi) libraries have already been developed and are widely used for 
systematic genetic studies in mammalian cells, they do have certain 
limitations—in particular, the RNAi-based downregulation of a par- 
ticular gene is not always sufficient to cause the phenotypic change of 
interest”’. Therefore, methods based on gene knockout screening are 
highly desirable. Here, we established an effective method for the con- 
struction ofa pooled gene-knockout library and demonstrated that this 
CRISPR-based strategy can be seamlessly implemented in functional 
genomics when combined with deep-sequencing analysis. The broad 
application of this scalable genetic tool may further augment the power 
of the CRISPR/Cas system in studying gene functions in a high- 
throughput fashion. 

While our manuscript was under revision, two parallel studies*? 
reported a similar lentiviral-based pooled screening using the CRISPR/ 
Cas9 system, both of which developed large libraries covering over 
18,000 (ref. 9) and 7,000 (ref. 8) genes. The genome-scale screening 
of the CRISPR/Cas9 library is valuable; however, it might be technically 
challenging to manage the library. Our approach presents an alterna- 
tive way—that is, via a focused library—to conduct functional genomics, 
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Figure 4 | Genetic validation of candidate genes. a, b, d, Effects of ANTXR1 
(a), HBEGF (b) and PECR (d) deficiency on PA/LFnDTA- and diphtheria- 
toxin-triggered cytotoxicity in HeLa cells. The cells were treated with different 
doses of toxins, and the XTT cell viability assay and LDH lethality assay were 
performed 48 h after toxin treatment. Data are presented as the mean = s.d. 
(n = 3). c Relative cell viability for HeLagc-SC cells transfected with indicated 
sgRNA constructs. Four days after transfection, cells were incubated with 


and is particularly advantageous for knowledge-based screening. The 
other difference is that ref. 9 chose to combine Cas9 and sgRNA expres- 
sion into a single vector, as it is time-consuming to generate the cell 
lines that express Cas9. However, we found that individual clones 
harbouring Cas9 varied in their efficiency to generate sgRNA-mediated 
indels. It is therefore important to select the best single clone for library 
construction for certain cell types. In addition, the universal background 
of Cas9 derived from a single clone is beneficial for the accurate assess- 
ment of sgRNA contribution manifested by the final counts from the 
deep-sequencing analysis. 


METHODS SUMMARY 


The lentiviral plasmid expressing OCT1 and Cas9 was constructed using Gibson’s 
method” in pLenti-CMV-BSD, and the lentiviral backbone used for the ssRNA 
library cloning was constructed in the pLL3.7 vector (Addgene, Inc.). The oligo- 
nucleotides used for the sgRNA library construction were individually designed 
(Supplementary Table 2) and synthesized (Ruibiotech, Inc.). HeLagc-SC cells 
infected by the library viruses were sorted by FACS (MOFLO, Cytomation) based 
on the presence of EGFP. The library cells were screened by three rounds of 
toxin treatment: PA (70 ng ml ~ 1) plus LEnDTA (50 ng ml 1) and diphtheria toxin 
(7.5ngml '). Three replicates were arranged for each toxin screening. The geno- 
mic DNA of each replicate as well as the original library was isolated using the 
DNeasy Blood and Tissue kit (Qiagen). sgRNA-coding regions integrated into the 
chromosomes were PCR-amplified (TransTaq DNA Polymerase High Fidelity, 
TransGen) through 26 cycles of reaction with primers annealed to the flanking 
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PA (70 ng ml ') plus LEnDTA (50 ng ml!) for 48h. Images of toxin-treated 
and untreated cells were taken (Extended Data Fig. 7) for cell number survey by 
Image] program. The cell viability was calculated as the percentage of surviving 
cells after toxin treatment. The data are the mean + s.d. (n = 6); **P< 0.01; 
***P < 0.001; NS, not significant; Dunnett’s multiple comparison test, 
one-way ANOVA. 


sequences of the sgRNAs (Supplementary Table 5) before being subjected to high- 
throughput sequencing analysis (lumina HiSeq 2500). 

PA and LFnDTA, a surrogate of LF consisting of the amino-terminal domain of 
LF fused to the catalytic subunit of diphtheria toxin, were produced by using plasmid 
pET-22b-PA and pET-15b-LFnDTA (Addgene plasmid 11079 and 11075), respect- 
ively. The diphtheria toxin was purchased from List Biological Laboratories, Inc. 
Cytotoxicity assays were performed as described™ with XTT (Roche) according to 
the product manual. LDH staining and detection were performed as described in 
the product instruction (CytTox96, Promega). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 
Cells and reagents. HT1080, HeLa and HEK293T cells were maintained in 
Dulbecco’s modified Eagle’s medium (DMEM, Invitrogen) supplemented with 
10% fetal bovine serum (FBS, Invitrogen) with 5% CO, in 37 °C. 
Plasmid construction. The lentiviral ssRNA vector (pLenti-sgRNA-Lib) was 
constructed by substituting its original U6 promoter in pLL3.7 (Addgene, Inc.) 
for the human U6 promoter, ccdB cassette and sgRNA scaffold. The gene OCT1 
was PCR-amplified from a human cDNA library (Abclonal, Inc.) and fused with a 
2A-linked** humanized Cas9 gene’®, followed by an IRES (pIRES, Invitrogen)- 
linked blasticidin-resistance gene. The construction of this plasmid (pLenti- 
OC-IRES-BSD) was accomplished using Gibson’s method””’, and was verified by 
sequencing analysis. Details regarding these two constructs are given in Supplemen- 
tary Fig. 1. 
Construction of the CRISPR/Cas9 sgRNA library. We created a library target- 
ing 291 human genes based on our previous shRNAmir library screening results 
(our own unpublished data). We have subjectively added additional genes based 
on previous knowledge and our research interests. We preferentially selected genes 
encoding cell surface proteins, and those important for endocytosis, trafficking and 
cell death. The oligonucleotides for each sgRNA-coding sequence for the library 
were individually designed (Supplementary Table 2) and synthesized (Ruibiotech, 
Inc.). Paired oligonucleotides were mixed in 96-well plates to the final concentra- 
tion of 9 1M in 50 pl of TransTaq HiFi Buffer II (1X) for annealing. These annealed 
oligonucleotide pairs were then mixed, phosphorylated and ligated into the lenti- 
viral sgRNA vector using the Golden Gate method. The ligation mixture was trans- 
formed into Trans1-T1 competent cells (Transgen) to obtain the library plasmids. 
Lentiviruses of the sgRNA library were obtained by co-transfection of library 
plasmids with two viral packaging plasmids psPAX2 and pMD2.G (from Didier 
Trono of EPFL) into HEK293T using the polyethylenimine (PEI) method. HeLa 
(OCT1-Cas9) cells were infected by the library viruses, followed by FACS (MOFLO, 
Cytomation) for EGFP-positive cells, 48 h after infection. The viral titre used was 
0.05, and the total cell number for viral infection was 4 X 10”. 
CRISPR/Cas9 sgRNA library screening. A total of 3 X 10’ sgRNA library cells 
were plated onto 100-mm Petri dishes at 1 X 10° cells per plate. After 24 h, the cells 
were treated by toxins at an appropriate concentration: PA (70ng ml‘) plus 
LFnDTA (50 ng ml 1). and diphtheria toxin (7.5 ng ml~ ') For each toxin screen- 
ing, five plates of cells were grouped together as one replicate, and three replicates 
were arranged for each toxin screen. For each round of screening, the library cells 
were incubated with the appropriate toxins for 2 days before being changed to 
fresh DMEM. The surviving cells were re-seeded for growth and subjected to another 
round of toxin screening. After three rounds of toxin treatment, the resistant clones 
and the original untreated library cells were collected separately for genomic DNA 
extraction, followed by PCR amplification of the sgRNA-coding region and deep- 
sequencing analysis. 


Identification of candidate sgRNA sequences. The genomic DNA of six repli- 
cates was isolated from 5 X 10° cells using the DNeasy Blood and Tissue kit (Qiagen). 
sgRNA-coding regions integrated into the chromosomes were PCR-amplified 
(TransTaq DNA Polymerase High Fidelity, TransGen) with 26 cycles of reaction 
using primers annealed to the flanking sequences of the sgRNAs (Supplementary 
Table 4). In this work, 0.5 ,tg genomic DNA was used in each 50-11 PCR reaction, 
16 PCR reactions were performed for each replicate, and different replicates were 
indexed with different barcodes. The PCR products (16 tubes) of each replicate 
were pooled and purified with DNA Clean & Concentrator-5 (Zymo Research 
Corporation), followed by high-throughput sequencing analysis (Illumina HiSeq 
2500). 

Statistical analysis. The R software package from Bioconductor, DESeq2, was 
used to perform a statistical analysis of the sequencing data. The enrichment of ssRNA 
was ranked by the average fold change of normalized counts: reads**?/reads“"". 
The adjusted P value was calculated to evaluate the data quality. The criterion for 
the selection of candidate sgRNAs and their targeted genes is an adjusted P = 0.05 
for which the fold changes are greater than that of the negative control plus its 
standard deviation (mean + s.d.). sgRNAs targeting ANTXR1 and HBEGF were 
chosen to serve as negative controls for the diphtheria toxin and PA/LFnDTA screen- 
ing, respectively. 

Cytotoxicity assay. PA and LFnDTA, a surrogate of LF consisting of the N- 
terminal domain of LF fused to the catalytic subunit of diphtheria toxin, were 
produced using plasmid pET-22b-PA and pET-15b-LFnDTA (Addgene plasmid 
11079 and 11075), respectively. Diphtheria toxin was purchased from List Bio- 
logical Laboratories, Inc. Cytotoxicity assays were performed as described” with 
XTT (Roche) according to the product manual. LDH staining and detection were 
performed as described in the product instruction (CytTox96, Promega). The 
death signal represented by the amount of LDH release was normalized to the 
wells based on the maximum LDH activity of the total lysed cells. Each data point 
and related error bar shown in the figures for the XTT or LDH assays represent the 
average results from three replicates. 

Real-time PCR. RNA of cultured cells was extracted using EasyPure RNA kit 
(Transgen, ER101-01), and the cDNAs were synthesized using PrimeScript 1st 
Strand cDNA Synthesis kit (TAKARA, 6110A). Real-time PCR was performed 
with Platinum SYBR green qPCR SuperMix-UDG (Life Technologies, C11733- 
038) on Stratagene Mx3005P qPCR system. Two pairs of primers were designed to 
examine the expression level of each gene. B-actin and GAPDH transcript levels 
were measured as internal controls. 


25. Kim, J. H. etal. High cleavage efficiency of a 2A peptide derived from porcine 
teschovirus-1 in human cell lines, zebrafish and mice. PLoS ONE 6, e18556 
(2011). 

26. Chang, N. et al. Genome editing with RNA-guided Cas9 nuclease in Zebrafish 
embryos. Cell Res. 23, 465-472 (2013). 
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Extended Data Figure 1 | Relative expression level of Cas9 by real-time 
PCR. Two pairs of primers (a, 5’-GGGGGACAGTCTTCACGAGC-3’ and 5’- 
CACGTACATGTCCCTGCCGT-3’; b, 5'-GCACGTGGCCCAAATTCTCG- 
3’ and 5'-AGCGGTGGCCTTGCCTATTT-3’) were designed to test Cas9 
expression level. cDNA of HEK293T serves as the negative control. B-Actin was 
used as the internal control and the Cas9 expression of HEK293Toc¢ was set as 
the calibrator. The data are the mean + s.d. (n = 3). 
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Extended Data Figure 2 | Images of cells after three rounds of toxin 
treatment: diphtheria toxin (7.5 ng ml‘) for 60h or PA/LFnDTA 
(70 ng ml~'/ 50ng ml’) for 48h. The fluorescence signals were 
superimposed onto white light images. Scale bar: 100 jim. 
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Extended Data Figure 3 | PCR amplification of the sgRNA region from 
the sgRNA library for deep-sequencing analysis, as indicated by 
electrophoresis. The genomic DNAs isolated from the indicated cells were 


used as templates for PCR amplification. 
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Extended Data Figure 4 | Scatter plots of log, normalized read counts 
between two sgRNA libraries. a, Scatter plots of log, normalized read counts 
between two of the three replicated sgRNA libraries. b, Scatter plots of logs 


normalized read counts between two sgRNA libraries with or without 
prolonged culturing (14 days), and three replicates of the original library were 
used for the comparison. 
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Extended Data Figure 5 | Effect of ANTXR1 deficiency on PA/LFnDTA- independent clones are shown. Primers used for the PCR reactions are listed in 


and diphtheria-toxin-triggered cytotoxicity in HeLa cells. a, Partial Supplementary Table 5, and the shaded nucleotides represent the PAM 
sequences of ANTXRI gene in genome containing TALENs binding regions sequences that guide the Cas9 for DNA recognition and cleavage in this and 
(underlined) and the mutated alleles from 10 randomly selected TALENs other figures. c, d, Susceptibility assay of two HeLa ANTXRI '~ clones on PA/ 


clones (in HeLa cells). The tall letter indicates insertion, and the dashes indicate | LFnDTA- and diphtheria-toxin-triggered cytotoxicity. The cells were treated 
deletions in this and other figures. b, DNA sequencing analysis of mutated with different doses of toxins, and the XTT cell viability assay and LDH lethality 
alleles in the human ANTXR1 locus. Partial coding sequences of targeted genes assay were performed 48 h after toxin treatment. Data are presented as the 

in the genome containing the CRISPR/Cas9 sgRNA3““7*™ binding region mean + s.d. (n = 3). 

(underlined) and sequence analysis of the mutated alleles from two 
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Extended Data Figure 6 | Effect of HBEGF deficiency on diphtheria-toxin- _ independent clones. b, Susceptibility assay of Hela HBEGF ‘~ clone 2 on 
triggered cytotoxicity in HeLa cells. a, DNA sequencing analysis of mutated — diphtheria-toxin-triggered cytotoxicity. The cells were treated with different 
alleles in the human HBEGF locus. Partial coding sequences of targeted genesin _ doses of toxins, and the XTT cell viability assay and LDH lethality assay were 
the genome containing the CRISPR/Cas9 ssRNA3"”*“* binding region performed 48 h after toxin treatment. Data are presented as the mean + s.d. 
(underlined) and sequencing analysis of the mutated alleles from two (n = 3). 
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Extended Data Figure 7 | Light microscopic images of HeLagc-SC cells The representative images for every condition (six images for each) are shown. 
transfected with indicated sgRNA constructs. Four days after transfection, Scale bar, 200 jm. 
cells were incubated with PA (70 ng ml ') plus LEnDTA (50 ng ml") for 48h. 
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Extended Data Figure 8 | Effects of RAB2A sgRNAs on diphtheria-toxin- treatment. HBEGF_sg3, labelled in red, was used as the positive control. The 
triggered cytotoxicity in HeLa cells. HeLagc-SC cells were transfected with data are the mean + s.d. (n = 3); *P < 0.05; **P < 0.01; ***P < 0.001; NS, not 
indicated sgRNA constructs. Four days after transfection, the LDH lethality significant; Dunnett’s multiple comparison test, one-way ANOVA. 

assay (a) and the XTT cell viability assay (b) were performed 48 h after toxin 
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Extended Data Figure 9 | Effect of PECR deficiency on PA/LFnDTA- and 
diphtheria-toxin-triggered cytotoxicity in HeLa cells. a,b, DNA sequencing 
analysis of mutated alleles in the human PECR locus. Partial coding sequences 
of targeted genes in the genome containing the CRISPR/Cas9 sgRNA1"PC® 
(a) and sgRNA2*PCR (b) binding region (underlined) and sequencing analysis 
of the mutated alleles from two independent clones are shown. ¢, Susceptibility 
assay of HeLa PECR ‘~_sg2 clone on PA/LEnDTA- and diphtheria-toxin- 
triggered cytotoxicity. The cells were treated with different doses of toxin, and 
the LDH lethality assay was performed 48 h after toxin treatment. Data are 
presented as the mean = s.d. (n = 3). d, Effect of PECR-targeting shRNA smart 
pool on HeLa cells to PA/LFnDTA toxicity. Three different shRNAs 
(TRCN0000046538: 5’-CCGGGCAGTGGAATACGGATCAATTCTC 


GAGAATTGATCCGTATTCCACTGCTTTTTG-3’, TRCN0000046539: 
5'-CCGGCCTCACCAAATCTTTAGCTTTCTCGAGAAAGCTAAA 
GATTTGGTGAGGTTTTTG-3’, and TRCN0000046541: 
5'-CCGGCCCGCTAAACGAATTGGTGTTCTCGAGAACACCAATTC 
GTTTAGCGGGTTTTTG-3’) from MISSION shRNA library (Sigma, Inc.) 
were chosen, and the MISSION shRNA control vector (SHC002) was used, 
which contains a sequence that engages with RISC without targeting any 
known mammalian gene. After transient transfection, 1.5 1g ml 
puromycin was added to obtain the stable shRNA expression cells. 

The cells were treated with PA (40 ng ml?) plus LEnDTA (50 ng ml’), 
and the XTT cell viability assay was performed 48 h after toxin treatment. 
Data are presented as the mean + s.d., n = 3, **P < 0.01; t-test. 
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Extended Data Figure 10 | Effects of ANTXR1 and ANTXR2 gene 
expression on the susceptibility of HeLa and HEK293T cells to PA/ 
LFnDTA. a, Effects of ANTXR1 and ANTXR2 gene knockout on the 
susceptibility of the indicated cells to the treatment of PA and LFnDTA. Details 
of the TALENs constructs used for gene targeting have been previously 
described’’. Cell clones transfected with TALENs were randomly picked and 
cultured with PA (100 ngml~*) and LFnDTA (50 ng ml *) for 3 days to 
determine their sensitivity to toxins. b, Level of ANTXRI and ANTXR2 
transcripts in HeLa and HEK293T cells was measured by RT-PCR, and B-actin 
and GAPDH were included as internal controls. To ensure accuracy, two pairs 
of primers were used to measure ANTXR1 and ANTXR2 transcript levels, 


respectively. The primers used in PCR reactions were: for ANTXR1, primer pair 
1 5'-GGAAACGGCTTCCGACATGC-3’and 5’-TGTTGTGGGGCCTGATG 
CAA-3’, primer pair 2 5’-GCCCGGATTGCGGACAGTAA-3’ and 5’- 
GCTGACCTGGAGTGCAGCTT-3’; for ANTXR2, primer pair 1 5’- 
TTGGACGGTCTGGTGCCATC-3’ and 5'-TCCTCCCCCACACA 
GACACT-3’, and primer pair 2 5’-ATGTGGTGGTTTTGGCCCCT-3’ and 
5'-GTGTGGGTTTGGGTCGAGGT-3’; for B-actin, 5’-AGAGCT 
ACGAGCTGCCTGAC-3’ and 5'-AGCACTGTGTTGGCGTACAG-3’; for 
GAPDH, 5’-ACCACAGTCCATGCCATCAC-3’ and 5’-TCCACCACCC 
TGTTGCTGTA-3’. 
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Copper is required for oncogenic BRAF signalling 


and tumorigenesis 


Donita C. Brady’, Matthew S. Crowel!, Michelle L. Turski!, G. Aaron Hobbs’, Xiaojie Yao", Apirat Chaikuad*, Stefan Knapp‘, 
Kunhong Xiao”, Sharon L. Campbell’, Dennis J. Thiele’ & Christopher M. Counter’? 


The BRAF kinase is mutated, typically Val 600—>Glu (V600E), to 
induce an active oncogenic state in a large fraction of melanomas, 
thyroid cancers, hairy cell leukaemias and, to a smaller extent, a wide 
spectrum of other cancers'”. BRAF’” phosphorylates and activates 
the MEK1 and MEK2 kinases, which in turn phosphorylate and acti- 
vate the ERK] and ERK2 kinases, stimulating the mitogen-activated 
protein kinase (MAPK) pathway to promote cancer’. Targeting MEK1/2 
is proving to be an important therapeutic strategy, given that a MEK1/2 
inhibitor provides a survival advantage in metastatic melanoma‘, an 
effect that is increased when administered together with a BRAFY ©" 
inhibitor’. We previously found that copper (Cu) influx enhances 
MEK!1 phosphorylation of ERK1/2 through a Cu-MEK1 interaction®. 
Here we show decreasing the levels of CTR1 (Cu transporter 1), or 
mutations in MEK] that disrupt Cu binding, decreased BRAFY"- 
driven signalling and tumorigenesis in mice and human cell settings. 
Conversely, a MEK1-MEKS5 chimaera that phosphorylated ERK1/2 
independently of Cu or an active ERK2 restored the tumour growth 
of murine cells lacking Ctrl. Cu chelators used in the treatment of 
Wilson disease’ decreased tumour growth of human or murine cells 
transformed by BRAFY™” or engineered to be resistant to BRAF inhi- 
bition. Taken together, these results suggest that Cu-chelation ther- 
apy could be repurposed to treat cancers containing the BRAF’°® 
mutation. 

Decreasing Ctrl expression suppresses MAPK phenotypes in Drosophila 
and signalling in mammalian cells. In a cupric setting MEK1 also binds 
Cu, and Cu chelation decreases MEK1/2 kinase activity®. Cu and, to a 
smaller extent, silver—which is isoelectric to cuprous Cu—increased the 
phosphorylation of ERK2 by MEK1, whereas other tested metals had 
no effect (Extended Data Fig. 1a). Given these results and the depen- 
dence of BRAF mutation-positive cancers on MEK1/2 for tumor growth*, 
we investigated whether decreasing the influx of Cu affects BRAF’ °°" 
oncogenesis. BRAFY" was expressed in Ctr1*'* and Ctr1~'~ mouse 
embryonic fibroblasts (MEFs)’, and intracellular Cu deficiency of the 
latter was confirmed by increased copper chaperone for superoxide dis- 
mutase (CCS) levels’® (Fig. 1a, b). The BRAFY "transformed Ctrl /~ 
MEFs showed decreased phosphorylated ERK1/2 (P-ERK1/2), cell 
growth, and tumour kinetics, effects that were rescued by expressing 
CTR1 but not the transport-defective mutant’! CTRI™™** (Fig. la-d and 
Extended Data Figs 2a—c and 10a, b). Thus, the Cu-transport function of 
CTR1 is required for robust BRAFY®° signalling and tumorigenesis. 

To assess whether decreasing the binding of Cu to MEKI affects 
BRAFY°°". driven tumorigenesis, targeted mutagenesis was undertaken, 
which revealed that M187A, H188A, M230A, H239A and one other 
mutation decreased the ability of MEK] to bind a Cu-charged resin and 
phosphorylate ERK1/2 (Extended Data Fig. 3a-c). A metal-catalysed 
oxidation reaction followed by mass spectrometry identified oxidation 
at H188, M230A, H239 and two other sites (Fig. le and Extended Data 
Fig. 4), suggesting that these residues reside within 10 A ofa Cuatom”. 


We thus focused on H188, M230 and H239 and included M187 as well, 
even though the oxidation status of this site could not be determined, it 
lies adjacent to H188 and was similarly required for Cu binding and 
kinase activity (Fig. le and Extended Data Figs 3c and 4e). These four 
residues are conserved in MEK2 (Extended Data Fig. 5), which, like 
MEK1, also bound a Cu-charged resin and was inhibited by tetrathio- 
molybdate (TTM), a Cu chelator (Extended Data Fig. 1b, c). In the 
three-dimensional MEK] structure”’, these four amino acids also clus- 
tered such that each was no more than 12.5 A from the next (Fig. le). 
Combined mutations at these sites progressively decreased MEK] kinase 
activity and affinity for a Cu-charged resin, with mutations at all four 
sites (copper-binding mutant; CBM) having the largest defect in kinase 
activity, even in the presence ofa constitutively active mutation (S218D/ 
$222D; DD)" or excess Cu (Fig. 1f-j). Nevertheless, MEK1°™ was still 
phosphorylated in BRAFY°""*-transformed cells, underwent coopera- 
tive unfolding and possessed similar thermostability to that of MEK1, 
as assessed by both circular dichroism and differential scanning fluori- 
metry (Fig. 1k, 1 and Extended Data Figs 6a—d and 10c). To exclude the 
possibility that the CBM mutation decreased kinase activity indepen- 
dently of Cu binding, we took advantage of the fact that MEKS is highly 
similar to MEK1/2, yet lacks two sites that are important for binding 
Cu (Extended Data Fig. 5) and did not bind the Cu-charged resin or 
show changes in kinase activity in the presence of Cu or TTM (Fig. 1m, n). 
Specifically, introducing a CBM-equivalent (CBM-E) mutation into 
MEKS was found to have no overt effect on the ability of MEK5 to phos- 
phorylate substrates myelin basic protein (MBP) or ERKS in vitro or 
in cells (Fig. lo, p). Given these results, we tested and found that the 
decrease in P-ERK1/2 and tumour growth of BRAF’*”"-transformed 
Ctr1*'* MEFs due to knockdown of endogenous Mek1 messenger 
RNA by short hairpin RNA (shRNA) (Fig. 1k) was rescued by expres- 
sing wild-type but not CBM mutant MEK] resistant to shaRNA-mediated 
knockdown (Fig. 11, q and Extended Data Fig. 10c). Thus, under normal 
Cu homeostasis, inhibiting Cu binding in MEK1 retards BRAFY°P- 
driven signalling and tumorigenesis. 

We next tested whether bypassing the requirement of MEK1/2 for 
Cu restores BRAFY®®-driven signalling and tumorigenesis to Ctrl /~ 
MEFs. MEK] was engineered to be independent of Cu by fusing the 
ERK1/2-binding region of MEK] protein’* to the kinase domain of 
MEKS protein, rendered active (S311D/T315D; DD)'* because MEK5 
is a substrate of MEKK2 and MEKK3 (ref. 17). Expressing MEK1-MEK5?? 
resulted in high levels of P-ERK1/2 in Ctrl ~!~ MEFs (Fig. lr, s), in con- 
trast to similarly activated MEK1””, which remained sensitive to TTM 
(Fig. 1j). Furthermore, MEK1-MEK5”” restored tumour growth to these 
Ctrl-'~ MEFs (Fig. 1t and Extended Data Fig. 10d). Similarly, expres- 
sing ERK2"°”’, which increases ERK2 autophosphorylation in a MEK1/ 
2-independent fashion’’, ERK2??!’ which renders the kinase insens- 
itive to phosphatases”, or ERK2°°F, which combines these two muta- 
tions, also led to robust ERK1/2 phosphorylation and restored tumour 
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Figure 1 | Binding of Cu to MEK1 promotes MAPK signalling and 
tumorigenesis by oncogenic BRAF. RT-PCR (a, k, r, u) and immunoblot 
(b, 1, p, s, v) detection of the indicated endogenous, ectopic (ect), or both 
(end/ect) mRNA and phosphorylated (P) and/or total (T) proteins from cells. 
IP, immunoprecipitated; WT, wild type. c, d, q, t, w, Mean tumour volume 
(+ s.e.m.) versus time of mice injected with BRAFY°"_transformed Ctr1*/* 
(black circles) or Ctr17/~ (orange squares) MEFs (n = 4) (c), BRAPYO™E_ 
transformed Ctr1~'~ MEFs expressing no transgene (orange squares), CTR1 
(black diamonds) or CTR1M'**“ (blue open circles) (n = 3) (d), BRAFY°"- 
transformed Ctr1*/* MEFs expressing scramble shRNA (black circles), Mek1 
shRNA alone (orange squares) or with RNA-mediated-interference-resistant 
MEK] (green open triangles) or MEK1@™ (blue open circles) (n = 3) 

(q), Ctrl '~ MEFs expressing BRAFY °°! (orange squares, n = 3) or 


Time (days) 


MEK1-MEKS5” (black open squares, n = 4) (t), or Ctr1~'~ MEFs expressing 
BRABY OE (orange squares, n = 3), ERK2¢° (black diamonds, n = 3), 
ERK2"°’S (yellow open triangles, n = 4) or ERK2?°?!N (green open triangles, 
n = 4) (w). Results were compared using a one-tailed unpaired t-test (c, t) or a 
one-way ANOVA followed by a Tukey’s multi-comparison post test 

(d, q, w). Two asterisks, P< 0.01; four asterisks, P< 0.0001. e, MEK1 structure 
(PDB 3EQD) denoting amino acids M187, H188, M230 and H239 and the 
intervening space (A)3, f-j, m-o, Immunoblot detection of the indicated 
recombinant proteins bound to a resin charged with or without Cu (f, m) or 
phosphorylated (P) or total (T) recombinant proteins with or without 50 1M 
TTM, a seven-step increase in TTM from 0 to 50 uM, or either 2.5 molar 
equivalents or 2.5 uM CuSO, (g, h, i, j, n, 0). Gel images are representative of 
at least two technical replicates. 
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growth to the Ctr1~'~ MEFs (Fig. lu-w and Extended Data Fig. 10d). 
Thus, while decreasing Cu levels will certainly have pleiotropic effects” 
that could affect the MAPK pathway”, activating this pathway in a 
manner independent of Cu nevertheless restores signalling and tumor- 
igenesis in cells deficient in Cu influx. 

To investigate the relationship between Cu and endogenous onco- 
genic BRAF in tumorigenesis, CTR1 was stably knocked down by shRNA 
in BRAF mutation-positive and mutation-negative melanoma cell lines. 
Knockdown of CTR1 decreased P-ERK1/2 levels in all tested cell lines, 
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Figure 2 | Knockdown of CTR1 decreases MAPK signalling and 
tumorigenesis specifically by oncogenic BRAF. a—c, RT-PCR detection of 
the indicated mRNAs (a), immunoblot detection of the indicated 
phosphorylated (P) or total (T) proteins from cells (b), and mean tumour 
volume (+ s.e.m.) versus time (c) of mice injected (n = 3 for DM738; n = 4 
for others) with the indicated cell lines expressing scramble (black circles) or 
CTRI (orange squares) shRNA. Results were compared using a one-tailed 
unpaired t-test. Asterisk, P< 0.05; three asterisks, P< 0.001. d,e, RT-PCR 
detection of the indicated endogenous or ectopic (ect) mRNA from cells (d) and 
mean tumour volume (+ s.e.m.) versus time (e) for mice (n = 3) injected with 
CMYC™***-transformed Ctr1*’* (black circles) or Ctrl /~ (orange squares) 
MEFs. Gel images are representative of two technical replicates. 
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although only the BRAF mutation-positive cell lines showed decreased 
tumour growth (Fig. 2a—c and Extended Data Fig. 10b), consistent with 
a dependence of BRAFY®* tumorigenicity on the MEK1/2 proteins®. 
Knockdown of CTR1 also inhibited the tumour growth of one of two 
tested NRAS mutation-positive melanoma cell lines (Extended Data 
Fig. 7a, b). Finally, loss of Ctrl had no effect on the ability of a different 
oncogene, CMYC?*4 to promote tumour growth (Fig. 2d, e). These 
results suggest a specific requirement for Cu in cancers that are partic- 
ularly dependent on the MAPK pathway. 

We next evaluated the requirement for Cu influx in vivo during 
spontaneous cancer development. Mice harbouring floxed conditional 
null or wild-type Ctrl alleles” were crossed into a background having 
one Cre-activated (CA) conditional oncogenic allele of Braf and two 
floxed conditional null alleles of Trp53 (Braf CAI Trp 53h oxiflox Bp)23, 
which on intranasal administration of adenovirus expressing Cre 
recombinase (AdCre) converts Braf CA to an oncogenic Brap allele 
and Trp53/"* to null alleles, leading to the development of lung 
adenocarcinomas”, AdCre was administered to cohorts of Ctr1*/* and 
Ctr ¥"°*"°* BP mice to induce cancer development and, in the latter case, 
to convert Ctrl”*"* to null alleles. Tumours arising in both cohorts 
showed the appropriate recombination of the Braf“*, Trp53/* and 
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Figure 3 | Genetic ablation of Ctrl decreases MAPK signalling and 
tumorigenesis and extends the lifespan in a mouse model of Braf’°®- 
driven lung cancer. a,c, e, Representative resected (arrows indicate visible 
lesions; scale bar, 1 cm) (a), haematoxylin/eosin-stained (scale bar, 1 mm) 

(c) and immunohistochemical detection of P-ERK1/2 (orange, positive pixels; 
scale bar, 500 1m) (e) in lungs of Ctr1 ** or Ctx BP mice (a, ¢, fixed 
endpoint; e, moribundity endpoint). b, d, f, Box-and whiskers plot of number 
of tumours per mouse (n = 5 lungs) (b), mean percentage of area of 
abnormal lung tissue (n = 5 lungs) (d) and mean percentage of P-ERK1/2 
positive-staining area per lung tumour (f) of Ctr1'’* (n = 199 tumours) and 
Ctr fowfiox (n = 142 tumours) BP mice (b, d, fixed endpoint; f, moribundity 
endpoint). g, Kaplan-Meier survival analysis of Ctr1*'* (black line, n = 30) 
and Ctry"°*“"°* (orange line, n = 30) BP mice. Results were compared using a 
one-tailed unpaired t-test (b, d, f) or a Mantel-Cox test (g). Asterisk, P< 0.05; 
two asterisks, P < 0.01; three asterisks, P< 0.001; four asterisks, P< 0.0001. 
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Ctr¥ alleles, and targeted loss of Ctr] did not result in a weight loss 
(Extended Data Fig. 8a, b). Ctriff* BP mice showed a decrease in 
the number of visible surface lesions, the area of abnormal lung tissue, 
and P-ERK1/2 staining in tumours and also a 15% survival advantage 
over Ctr1*'* BP mice (Fig. 3a-g and Extended Data Fig. 10a, b). Thus, 
the loss of Ctrl retards BRAFY°°**-driven tumorigenesis, leading to a 
survival advantage. 

The Cu chelators D-penicillamine, trientine and TTM are efficacious, 
long-term treatments for lowering systemic Cu levels in individuals 
with the Cu-overload disorder Wilson disease’. Capitalizing on these 
drugs, we found that TTM decreased the growth of BRAF 600E but 
not ERK2°°F, MEK1-MEK5”” or CMYC'***-transformed MEFs in 
soft agar. The growth of BRAF mutation-positive melanoma cell lines 
in soft agar was also inhibited by TTM to various degrees in a dose- 
dependent manner, whereas BRAF/NRAS mutation-negative lines were 
resistant to TTM (Fig. 4a and Extended Data Fig. 10e). Oral TTM treat- 
ments for only two weeks also decreased the size of tumours in mice 
injected with either BRAFY*-transformed MEFs or the BRAF mutation- 
positive melanoma cell line DM440, with no negative effect on mouse 
weight (Fig. 4b, c and Extended Data Figs 9 and 10e). This effect could 
be ascribed to a decrease in MEK1/2 kinase activity, because TTM halved 
the number of P-ERK1/2-positive cells in BRAF’°°°"-driven tumours 
and failed to inhibit tumour growth of ERK2°°"-transformed cells 
(Fig. 4d, e). To investigate more aggressive Cu-lowering therapeutic 
methods, mice injected with BRAF 600E transformed MEEFs were pro- 
vided with a Cu-deficient (CuD) diet supplemented with Cu and orally 
treated with vehicle alone for two weeks, or provided with a CuD diet 
and treated for two weeks with vehicle alone, oral TTM, or oral TTM 
followed by oral trientine (Fig. 4f). Tumours in mice fed with a CuD 
diet trended towards being smaller, although this did not reach signi- 
ficance (P = 0.055). Combining this diet with TTM blocked the devel- 
opment of visible tumours during the two weeks of treatment, although 
tumours quickly emerged after treatment ceased. However, mice treated 
with oral trientine after TTM showed a more durable response (Fig. 4g 


Figure 4 | Pharmacological chelation of Cu decreases tumour growth of 
BRAFY©°£_driven and vemurafenib-resistant tumour cells. a, Normalized 
average soft agar growth (+ s.e.m.) of Cirl *!* MEFs (plated in triplicate) 
expressing the indicated transgenes or the indicated cell lines treated with 
vehicle (black bar), 100 nM TTM (grey bar) or 400 nM TTM (white bar). Four 
asterisks, P< 0.0001. Results are representative of three experiments. 

b, c, e, Mean tumour volume (+ s.e.m.) versus time for tumours of mice (n = 3) 
injected with BRAFY°_transformed Ctr1*/* MEFs (b), DM440 cells (c) 

or ERK2°°F transformed Ctr1~/~ MEFs (e) and treated with either vehicle 
(black circles) or TTM (orange squares). Asterisk, P < 0.05; two asterisks, 
P<0.01. d, Box-and-whiskers plot of percentage of P-ERK1/2 positive-stained 
cells per field of tumours from mice (n = 3) injected with BRAFY°r- 
transformed Ctr1*/* MEEs and treated with vehicle (black line) or TTM 
(orange line). Asterisk, P< 0.05. f, g, Dosing and diet regimen (f) and mean 
tumour volume (+ s.e.m.) versus time (g) of mice (m = 3) injected with 
BRAFY°= transformed Ctr1*/* MEFs and provided with a copper-deficient 
diet (CuD) with either deionized HO (diH,O) and treated for two weeks with 
vehicle (group A, black circles) or diH,O supplemented with CuSO, and 
treated for two weeks with vehicle (group B, blue triangles), TTM (group C, 
orange squares) or TTM then trientine (group D, green open diamonds). Three 
asterisks, P< 0.001. h, Immunoblot detection of the indicated mRNA and 
phosphorylated (P) or total (T) recombinant proteins with or without 50 1M 
TTM. i, Immunoblot detection of the indicated recombinant proteins bound 
to resin charged with or without Cu. j,k, RT-PCR (j) and immunoblot 
detection of indicated phosphorylated (P) or total (T) endogenous, ectopic 
(ect), or both (end/ect) proteins (k) in A375 cells expressing haemagglutinin 
(HA)-tagged MEK1 or HA-MEK1°””?*. ], Mean tumour volume (+ s.e.m.) 
versus time of mice injected with A375 cells expressing MEK] (top) or 
MEK1°””!S mutant (bottom) and treated with vehicle (black circles, n = 4), 
TTM (orange squares, n = 3) or vemurafenib (blue triangles, n = 4). Results 
were compared using a one-tailed unpaired t-test (b, c, d) or a one-way 
ANOVA followed by a Tukey’s multi-comparison post test (a, g, 1). Two 
asterisks, P< 0.01; three asterisks, P< 0.001. Gel images are representative of 
at least two technical replicates. 
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and Extended Data Fig. 10e). Thus, Cu-lowering strategies used to man- 
age Wilson disease’ could be repurposed for the treatment of BRAFY"- 
driven cancers. 

BRAFY”* inhibitors such as vemurafenib have a limited duration 
of response as a result of acquired resistance that is often associated with 
the reactivation of MEK1/2 kinases™. One reported mechanism of resis- 
tance is the acquisition of an activating C121S mutation” in MEK1. 
Although MEK1°"?'S had elevated kinase activity in vitro, it still bound 
to the Cu-charged resin and its kinase activity was still inhibited by 
TTM (Fig. 4h, i). Moreover, although expressing MEK1°?!§ in the 
BRAF mutation-positive melanoma cell line A375 increased P-ERK1/2 
levels and imparted resistance to vemurafenib, in contrast with the same 
cells expressing MEK1, the MEK1°’"'*-expressing A375 cells never- 
theless remained sensitive to TTM in vivo (Fig. 4j-land Extended Data 
Fig. 10f). Thus, Cu chelation decreases tumorigenicity driven by a muta- 
tion that confers resistance to a BRAF’ °°” inhibitor. 

Although the detailed mechanism underlying the requirement of 
Cu for robust MEK1/2 kinase activity remains to be explained, we have 
shown that decreasing the influx, bioavailability and binding of Cu to 
MEK] decreases MEK1(2) kinase activity and oncogenic BRAF-driven 
tumorigenesis. Cu chelators, which are generally safe and economi- 
cal drugs that have been given daily for decades to manage Cu levels 
in patients with Wilson disease’, also decreased oncogenic BRAF- and 
MEK1-driven tumorigenesis. Thus, inhibiting MEK1/2 kinase activity 
with Cu chelators, perhaps in combination with other MAPK pathway 
inhibitors, may merit clinical consideration for the treatment of not 
only BRAF mutation-positive cancers but also cancers developing resis- 
tance to BRAFY®" and potentially even MEK1/2 inhibitors. 


METHODS SUMMARY 


Binding to Cu-charged resin, in vitro kinase assays, metal-catalysed oxidation, cir- 
cular dichroism and differential scanning fluorimetry were performed as described 
previously*”***. Standard methods were used to generate cell lines and assay RNA 
and protein levels and growth in soft agar. Measurements of tumour growth in 
immunocompromised mice treated with the indicated drugs and fed with the indi- 
cated diets as well as monitoring the tumour development and lifespan of BP mice” 
ina Ctr1*'* ora Ctrl”** background” treated intranasally with AdCre”* were 
performed under a protocol approved by the Institutional Animal Care and Use 
Committee. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Cell lines. Ctr1*/* and Ctrl ~/~ MEFs (immortalized with a plasmid encoding the 
SV40 T-Ag gene) were described previously’. A375 was purchased from ATCC. 
DM175, DM440, DM443, DM598, DM646, DM738 and DM792 cell lines were 
provided by D. S. Tyler”. Mouse lung cancer cell lines were created and cultured as 
described previously*’. Ctrl */* and Ctrl ~‘~ immortalized MEFs and A375, DM175, 
DM646, DM440, DM598, DM738 and DM792 cells lines were stably infected with 
retroviruses derived from pBABE, pWZL or pSUPER-based vectors (see plasmids 
below) as described previously". Ctr1‘/* MEFs were stably transfected with pCMV- 
based vectors (see plasmids below), using established protocols. 

Plasmids. pBABEpuro-MYC-HIS-BRAFY °°", pSUPER-retro-puro-tetO-RALB- 
scramble (encoding RALB scramble shRNA), pBABEpuro-CMYC'**“ and pCMV- 
HA-MEK1" were described previously*****, pBABEbleo-Flag-BRAF’ °° was 
created by PCR subcloning BRAF’™ from the pBA BEpuro-MYC-HIS-BRAF Yo" 
plasmid with primers designed to include an amino-terminal Flag tag. pWZLblasti- 
CTR? and pWZLblasti-CTR1™1*4 were created by PCR subcloning CTRIMT or 
CTRIM"**4 from the pCDNA3.1-CTR1”" or pCDNA3.1-CTR1M™* plasmids". 
pGEX6P1-GST-HA-MEK1™" (GST, glutathione S-transferase) and pWZLblasti- 
HA-MEK1™' were created by PCR subcloning MEK1™” from the pENTRI-MEK1“" 
plasmid* with primers designed to include an N-terminal haemagglutinin tag. 
pGEX6P1-GST-HA-MEK1°™ and pWZLblasti-HA-MEK1°™ (M187A/H188A/ 
M230A/H239A; CBM), pGEX6P1-GST-HA-MEK1°?!S, pWZLblasti- HA-MEK1°45, 
pGEX6P1-GST-HA-MEK1™ 187411884 and pGEX6P1-GST-HA-MEK1 70412394 
were created by introducing mutations corresponding to the indicated amino-acid 
changes by site-directed mutagenesis into MEK1™” from the pENTRI-MEK1* 
plasmid’ followed by PCR subcloning with primers designed to include an N- 
terminal HA tag. pGEX6P1-GST-HA-MEK5™" and pWZLblasti-HA-MEK5 
were created by PCR subcloning MEKS™" from the pWZLneo-Myr-Flag-MEK5 
plasmid. pGEX6P1-GST-HA-MEK5“™¥ and pWZLblasti-HA-MEK5@™ = 
(L280A/H281A/M323A/Q332A; CBM-E) were created by introducing mutations 
corresponding to the indicated amino-acid changes by site-directed mutagenesis 
into MEK5™" from the pWZLneo-Myr-Flag-MEKS5 plasmid® followed by PCR 
subcloning with primers designed to include an N-terminal HA tag. pGEX6P1- 
GST-HA-ERK5*** was created by introducing a mutation corresponding to the 
indicated amino-acid change by site-directed mutagenesis into ERK5“”" from the 
pWZLneo-Myr-Flag-ERK5 plasmid” followed by PCR subcloning with primers 
designed to include an N-terminal HA tag. pWZLblasti-HA-ERK5“”" was created 
by PCR subcloning ERK5™” from the pWZLneo-Myr-Flag-ERKS plasmid” with 
primers designed to include an N-terminal HA tag. pBABEpuro-HA-MEK1-MEK5>> 
and pWZLblast-HA-MEK1-MEKS5”” ($311D/T315D; DD) were created by intro- 
ducing mutations corresponding to the indicated amino-acid changes by site-directed 
mutagenesis into MEK5“" from the pWZLneo-Myr-Flag-MEKS plasmid” fol- 
lowed by two-step PCR to fuse MEK] (nucleotides 1-201) to MEKS (nucleotides 
498-1344) in frame with primers to include an N-terminal HA-tag. pGEX6P1- 
GST-HA-ERK2“*48, pWZLblasti- HA-ERK2°°’S, pWZLblasti- HA-ERK2?°1, 
pBABEpuro-HA-ERK2°° and pWZLblasti-HA-ERK2° (R67S/D321N; GOF) 
were created by introducing mutations corresponding to the indicated amino acid 
changes by site-directed mutagenesis into ERK2”" from the pDONR223-ERK2”" 
plasmid’ followed by PCR subcloning with primers designed to include an N- 
terminal HA tag. pSUPER-retro-puro-tetO-CTR1-shRNA was created to express 
the human CTR1 shRNA target sequence 5’-AAAGCCCAGCTTTCTCTTTGG-3’. 
pSUPER-retro-puro-tetO-Mek1-shRNA was created to express the mouse Mek1 
shRNA target sequence 5’-GCCTCTCAGCTCATATGGAAT-3’ (ref. 37). pCMV- 
HA-MEK1 encoding mutants M94A, H100A, H119A, M187A, H188A, C207A, 
M230A, H239A, M256A, M308A, C341A and H358A were created by site-directed 
mutagenesis of MEK1” from the pCMV-HA-MEK1™" plasmid. 

Reverse transcriptase PCR. RNA was purified from MEFs or tumour cell lines 
and reverse transcribed (RT) as described previously*’, and then PCR amplified 
with the primers 5’-ATCCTCATCAGCTCCCAATG-3’ and 5’-CACATCACCA 
TGCCACTTTC-3’ to detect human BRAF; 5'-CTGTTTTCCGGTTTGGTGAT-3' 
and 5’-TGCCCAACAGTTTTGTGTGT-3’ to detect human CTR1; 5’-ATGAAC 
CATATGGGGATGAACCATA-3’ and 5'-TCAATGGCAGTGCTCTGTGATG 
TC-3' to detect mouse Ctr1; 5'-GCACAGTCAAGGCCGAGAAT-3’ and 5’-GC 
CTTCTCCATGGTG GTGAA-3' to detect mouse Gapdh; 5'-CCTTGAGGCCT 
TTCTTACCC-3’ and 5'-CCCACGATGTACGGAGAGTT-3’ to detect human 
MEKI1; 5'-GTGAACTCACGTGGGGAGAT-3’ and 5’-CAGGAGGAGGAATG 
GGGTAT-3’ to detect mouse Mek1; 5’- CCTTGCAGAAGAAGCTGGAG-3’ and 
5'-TCGGGACATGATATGCTTTG-3’ to detect human MEKI-MEK5””; 5'-TG 
ATCACACAGGGTTCCTGA-3’ and 5'-TGGAAAGATGGGCCTGTTAG-3’ to 
detect human ERK2; 5'-GAGAGACCCTCACTGCTG-3’ and 5’-GATGGTACA 
TGACAAGGTGC-3’ to detect human GAPDH; and 5'-ACGAGCACAAGCTCA 
CC-3' and 5’-TTTCCACACCTGGTTGC-3’ to detect human CMYC. The fold 
change in the ratio of CTRI mRNA to total GAPDH mRNA was measured in Image] 
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software by boxing each band per representative image with the rectangular selec- 
tion tool, and calculating the total area of the band in pixels. The total area of the 
CTR1 mRNA band in pixels was normalized to the total area of the total GAPDH 
mRNA band in pixels. The fold change is shown in figures. 

Immunoblot analysis. Equal quantities of lysates were isolated from the indicated 
cell lines after 24h of serum starvation and then resolved by SDS-PAGE and 
immunoblotted as described previously® with one of the following primary anti- 
bodies: mouse anti- MEK1, rabbit anti-ERK2, rabbit anti- MEK2, mouse anti- MEK1/2, 
rabbit anti-ERK1/2, rabbit anti-ERK5, rabbit anti-phospho(Ser 217/221)-MEK1/2, 
rabbit anti-phospho(Thr 202/Tyr 204)-ERK1/2, rabbit anti-phospho(Thr 218/Tyr 220)- 
ERKS (Cell Signaling Technology), mouse anti-HA (Covance), rabbit anti-CCS 
(Santa Cruz Biotechnology), mouse anti-f-actin (Sigma), mouse anti- MEK5 (BD 
Transductions Laboratories), rat anti-myelin basic protein (Millipore) or mouse 
anti-phospho(Thr 98)-MBP (Millipore), followed by detection with one of the fol- 
lowing horseradish peroxidase-conjugated secondary antibodies: goat anti-rabbit 
IgG, goat anti-mouse IgG (Invitrogen), goat anti-mouse light-chain-specific IgG 
or mouse anti-rabbit light-chain-specific IgG (Jackson ImmunoResearch Labo- 
ratories) using enhanced chemiluminescence (ECL; GE Healthcare) or SuperSignal 
West Femto Chemiluminescent substrate (Pierce) detection reagents. The fold 
change in the ratio of phosphorylated protein to total protein was measured in 
ImageJ software by boxing each band per representative image with the rectangular 
selection tool, and calculating the total area of the band in pixels. The total area of 
the phosphorylated protein band in pixels was normalized to the total area of the 
total protein band in pixels. The average fold change is shown in figures. 
Immunoprecipitation. Cell lysate (500 1g) was incubated overnight with 5 ul 
of mouse anti-HA antibody (Covance) at 4°C. The immunoprecipitates were 
collected with GammaBind G Sepharose beads (GE Healthcare) for 2h at 4°C. 
SDS-PAGE analysis and immunoblotting were performed as described above. 
Protein purification. Recombinant GST-ERK2, GST-MEK1, GST-ERK5 and 
GST-MEKS proteins were expressed from pGEX-based vectors (see plasmids above) 
and purified as described previously’. Recombinant MEK1“", MEK1”” (S218D/ 
$222D; DD), MEK1°™ and MEK1©@2>“/PP proteins were a gift from A. Stewart, 
E. Johnson and C. Cronin. Recombinant GST-MEK2 and GST-MEKS were pur- 
chased from Abnova. Recombinant MBP was purchased from Millipore. 

In vitro kinase assays. MEK1, MEK2 and MEKS in vitro kinase assays were per- 
formed as described previously’. In brief, 0.6 ug of recombinant GST-ERK2**48, 
GST-ERKS<**® or MBP and 1.4 Ug of recombinant MEK1, GST-MEK1, GST- 
MEK2 or GST-MEKS, or mutants thereof, were incubated for 30 min in 180 ul of 
kinase buffer in the presence or absence of 2.5 molar equivalents of CuSO,, AgNO3, 
FeNH,SO,, NiSO, or ZnSO, (Fig. 1h and Extended Data Fig. 1a), a seven-step titra- 
tion of TTM (Sigma) from 0-50 [iM (Fig. 1f), a fixed 50 1.M concentration of TTM 
(Figs 1j,n and 4g and Extended Data Fig. 1c) and/or a fixed concentration of 2.5 uM 
CuSO, (Fig. In and Extended Data Fig. 1c) at 22°C. 

Copper binding. Recombinant GST-MEK1, GST-MEK2, GST-MEKS proteins 
or mutants thereof (1 1g) were incubated for 2 h in 500 pil of RIPA containing 30 pl 
of Cu-pentadentate resin (A ffiland) or 30 1l of free pentadentate resin (A ffiland) at 
4 °C, SDS-PAGE analysis and immunoblot were performed as described above. 
Cu pulldown experiments were performed as described previously*. 
Metal-catalysed oxidation, mass spectrometry and data analysis. GST-MEK1 
was loaded with Cu(i1) by using a CuSO, -histidine complex as described previously’. 
Metal-catalysed oxidation (MCO) reactions were performed as described previously”*. 
In brief, metal-catalysed oxidation (MCO) reactions were performed at 37 °C in 
50mM HEPES pH 8.0, 150 mM NaCl containing 40 14M GST-MEK1, 100mM 
ascorbate and 1mM H,0,. Untreated MEK1 was included as the control. The 
reactions were stopped by precipitating the proteins in methanol/chloroform 
followed by reduction for 30 min with 2.5 mM dithiothreitol (DTT) at 37 °C and 
alkylation with 20 mM iodoacetamide in the dark at room temperature for 30 min 
(refs 38, 39). In-solution digestion was performed overnight at 37 °C with trypsin 
(1:40 enzyme:protein ratio; modified, sequencing grade; Promega), which hydro- 
lyses peptide bonds at the carboxyl end of lysine and arginine, or chymotrypsin 
(1:60 enzyme:protein ratio; Sigma), which hydrolyses peptide bonds at the carboxyl 
end of aromatic or large hydrophobic side chains of tyrosine, tryptophan, pheny- 
lalanine, methionine and leucine. The digested peptide samples were desalted with 
stage tips*® and freeze-dried with a SpeedVac. Liquid chromatography-tandem 
mass spectrometry (LC-MS/MS) analyses were performed on a Thermo Scientific 
LTQ Orbitrap XL mass spectrometer (Thermo Scientific) with a Finnigan Nano- 
spray II electrospray ionization source. Digested peptides were injected onto a 
75 um X 150mm BEH Cjs column (particle size 1.7 um; Waters) and separated 
using a Waters nanoACQUITY Ultra Performance LC (UPLC) System (Waters). 
The LTQ Orbitrap XL was operated in data-dependent mode using the TOP10 
strategy as described previously*’. In brief, each scan cycle was initiated with a full 
MS scan of high mass accuracy (375-1,800 m/z; acquired in the Orbitrap XL at 
6 X 10* resolution setting and automatic gain control target of 10°), which was 
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followed by MS/MS scans (automatic gain control target 5,000; threshold 3,000) in 
the linear ion trap on the ten most abundant precursor ions. Selected ions were 
dynamically excluded for 30s. Singly charged ions were excluded from MS/MS 
analysis. MS/MS spectra were searched by using the SEQUEST algorithm against 
the human MEK1 sequence. Search parameters allowed for two missed tryptic 
cleavages, a mass tolerance of +80 p.p.m.,a static modification of 57.02146 Da (car- 
boxyamidomethylation) on cysteine, and dynamic modifications of 15.99491 Da 
(oxidation) on methionine or histidine. The MS/MS spectra of matched peptides 
were validated manually. Modelling of the distance between the thiol groups of 
methionines or imidazole groups of histidines was performed with the three- 
dimensional structure of MEK1 in complex with magnesium and ATP-7S deter- 
mined at 2.1 A (PDB 3EQD)". 

Circular dichroism spectroscopy. Circular dichroism (CD) data were collected 
ona JASCO J-815 CD spectrometer with a JASCO Peltier device and water bath to 
control the temperature. Experiments were performed in a 1-mm cuvette at a protein 
concentration of 5 1M in 20 mM Tris-HCl pH 8.0 and 100 mM NaCl. Far-ultraviolet 
scans were collected from 200 nm to 250 nm. Thermal denaturation of MEK1 and 
MEK1°®™™ proteins was monitored at 222 nm to estimate the protein melting tem- 
perature. The temperature ramp rate was 1°C min‘ and data points were col- 
lected every 1 °C. All data are reported in units of mean residue ellipticity, which 
was calculated from [0] Mrz = (Oraw X MRW)/10cl, where 0;aw is the ellipticity in 
degrees, MRW is molecular mass in daltons divided by (number of residues — 1), c 
is the protein concentration (g ml ~ 1) and lis the path length of the cuvette (cm), 
as described previously”’. 

Differential scanning fluorimetry. Differential scanning fluorimetry data were 
collected on a CFX384 Touch™ Real-Time PCR Dectection System (Bio-Rad) ata 
protein concentration of 1.3 1M in 25 mM Tris-HCl pH 7.5, 20 mM MgCl, 2 mM 
DTT using SYPRO orange as described previously**. The melting temperature was 
calculated by determining the maximum of the first-derivative curve of normal- 
ized data in Prism 5 (GraphPad) as described previously”. 

Mouse xenografts, drug treatments and diet alteration. MEFs (5X 10°) or 
melanoma cells (10”) resuspended in PBS were injected subcutaneously into flanks 
of 8-12-week-old female SCID/beige mice (Charles River Laboratory) as described 
previously*”. Unblinded drug treatments were as follows: vehicle (1% methylcel- 
lulose (Sigma), 1% dimethylsulphoxide (DMSO; Sigma)), 2.0 mg tetrathiomolyb- 
date (TTM; Sigma) in vehicle, or 20 mgkg- ' vemurafenib (Chemietek) in vehicle 
every other day by oral gavage. Mice were fed with a normal diet (PMI 5053 Picolab 
Mouse Diet 20; LabDiet) or, where indicated, a Cu-deficient diet (CuD diet, TD.80388; 
Harlan Teklad). Mice fed with a CuD diet were administered deionized HO (diH,0) 
supplemented with 20 mg] ' CuSO,, diH,0 alone, or diH,O supplemented with 
3¢1 | trientine dihydrocholoride (Sigma). All studies were approved by the Duke 
University Institutional Animal Care and Use Committee. Statistical analysis of 
tumour volumes at endpoint was performed with a one-tailed, unpaired t-test, 
with a 95% confidence interval for two group data sets or one-way analysis of 
variance (ANOVA) with a 95%, 99% or 99.9% confidence interval and Tukey’s 
multiple comparison post test for three or more data sets in Prism 5 (GraphPad). 
Statistical analysis of the percentage of mice with tumours of at least 1.0 cm* versus 
time was analysed by using a survival curve log-rank (Mantel—Cox) test in Prism 5 
(GraphPad). 

Mouse lung cancer model. Ctr"”“"* mice” were interbred with Braf \* ;Trp53/°" 
(BP) mice”’, a gift from D. G. Kirsch, for three generations to generate Braf CAI, 
Trp53l"*.Ctr1*!* and Braf &™* ;Trp53°"!"*;CtrU"!* mice. PCR was per- 
formed to detect wild-type, conditional and recombined alleles of Braf, Ctrl and 
Trp53, as described previously”***. Cohorts of these animals of known genotype 
were administered 6 X 10° plaque-forming units of adenoviral Cre (University of 
Iowa) intranasally between 69 and 85 days of age. Mice were then monitored and 
killed euthanized three months later or at moribundity endpoints. All studies were 
approved by the Duke University Institutional Animal Care and Use Committee. 
Statistical analysis of percentage survival versus time was conducted with a sur- 
vival curve log-rank (Mantel-Cox) test in Prism 5 (GraphPad). 

Analysis of lung tumours. To quantify the number of visible surface tumours per 
mouse, lungs were resected from five Ctrl +/+ and five Ctrl BP mice of known 
genotype three months after treatment with AdCre and the numbers of tumours 
visible on the surface of lungs were counted. Statistical analysis of the average 
number of tumours per mouse was performed using a one-tailed, unpaired t-test, 
with a 95% confidence interval for two group data sets. To quantify the percentage 
of abnormal lung tissue, lungs were resected from five Ctr1*/* and five Ctr" 
BP mice three months after treatment with AdCre, fixed in 10% formalin and 
embedded in paraffin. Sections were deparaffinized, rehydrated and subjected to 
epitope retrieval before staining with haematoxylin (Surgipath) and eosin (Fisher 
Scientific). Photographs were taken at low power on an Olympus Vanox S$ micro- 
scope to encompass the entire lung; images were blinded, and the abnormal areas 


of adenoma and adenocarcinoma from each micrograph were circumscribed using 
Adobe Photoshop and the abnormal area in pixels was expressed as a percentage of 
the total lung area from all micrographs. Statistical analysis of the average percent- 
age of abnormal tissue per mouse was performed with a one-tailed, unpaired t-test, 
with a 95% confidence interval for two group data sets. 

Immunohistochemistry. Three xenograft tumours from mice injected with BRAFY- 
transformed Ctr1‘/* MEFs or lungs resected at moribundity endpoint from 
Ctr1*!* and Cir¥”“"** BP mice treated with AdCre were fixed in 10% formalin 
and embedded in paraffin. Sections were deparaffinized, rehydrated and subjected 
to epitope retrieval and stained with an anti-P-ERK1/2 (Thr 202/Tyr 204) anti- 
body (Cell Signaling), followed by peroxidase-based detection with Vectastain 
Elite ABC Kits (Vector Labs) and counterstaining with haematoxylin. Pho- 
tographs were taken of high-power fields of highest positivity (P-ERK IHC) on 
an Olympus Vanox S microscope. Images were blinded and tumours were cir- 
cumscribed in P-ERK1/2-stained tissue images with the freehand selection tool in 
Image J, and the total area of the tumour in pixels was recorded. Tumour images 
were copied and pasted to new, blank images and colour thresholding was applied 
to determine positive-staining areas, using the same parameters for each tumour 
image. Areas staining positive by these parameters were selected and the positive- 
staining area in pixels was recorded. The percentage positive-staining area was 
calculated by dividing the positive-staining area of the tumour in pixels by the total 
area of the tumour in pixels. Statistical significance was determined with unpaired, 
one-tailed t-tests between treated and untreated, or between Ctr1*/* and Ctr pe*!"°* 
cohorts, using Prism 5 (GraphPad). 

Cell growth and soft agar assays. For growth-curve experiments, cells were plated 
at a density of 5,000 cells per well in six replicate wells in four 24-well plates. At 
days 0, 1, 2 and 3, one plate was fixed for 5 min in formalin, washed with PBS and 
stained for 30 min with 0.1% crystal violet (Sigma) and left to dry for at least 24h. 
Stain was extracted in 200 pil of 10% acetic acid in each well, transferred to a 96-well 
plate, and the absorbance at 600 nm was measured with a GloMax Multi Detection 
System plate reader (Promega) in an unblinded fashion. Relative growth was deter- 
mined by normalization to the signal at day 0 and plotted in Prism 5 (GraphPad). 
Anchorage-independent growth was assayed in six-well plates with 1 ml of 0.6% 
bactoagar media solution (final concentration 1 X DMEM, 10% FBS, 1 X penicillin/ 
streptomycin) as a bottom support layer; 5 X 10° cells per well were resuspended in 
DMEM (10% FBS, 1 X penicillin/streptomycin) and mixed 1:1 with 0.6% bactoagar 
media solution with an appropriate concentration of DMSO or TTM to givea final 
bactoagar concentration of 0.3%, 100 nM TTM, 400 nM TTM, or an equivalent 
amount of DMSO and plated in triplicate. Each well was fed on days 3,7, 14and 21 
with 300 pl of DMEM with the appropriate concentration of DMSO or TTM. 
Colonies were counted unblinded between days 21 and 28. Statistical analysis of 
soft agar growth was performed with a two-way ANOVA with a 99.9% confidence 
interval and Tukey’s multiple comparison post test in Prism 5 (GraphPad). 
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Extended Data Figure 1 | CuSO, stimulates MEK1/2 kinase activity in vitro. 501M TTM (c). Total (T) levels of ERK2, MEK1 and MEK2 serve as loading 
a, c, Detection of the amount of in vitro phosphorylated (P) recombinant controls. b, Immunoblot detection of the amount of recombinant GST-tagged 
GST-tagged kinase-inactive ERK2“™** protein by recombinant GST-tagged MEKz2 protein bound to a resin charged with (Cu) or without (—) Cu. 

MEK1 in the presence, when indicated, of 2.5 molar equivalents of CuSO, (Cu), Input serves as a loading control. Gel images are representative of two 
AgNO; (Ag), FeNH2SO, (Fe), NiSO, (Ni) or ZnSO, (Zn) (a) or recombinant _ technical replicates. 

GST-tagged MEK2 in the presence, when indicated, of 2.5 1M CuSO, and/or 
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Extended Data Figure 2 | Genetic ablation of Ctr1 decreases BRAFY°P- (b) and Kaplan-Meier analysis of percentage of mice with tumour volume 
mediated cell growth and tumorigenesis. a, Cell growth, as measured by at least 1.0 cm” versus time (c) of mice (n = 8) injected with BRAFY- 


staining with crystal violet, of BRAFY®©-transformed Ctr1*/* (black circles) transformed Ctr1*/* (black line) or Ctr1~/~ (red line) MEFs. Results were 
or Ctrl /~ (red squares) MEFs (plated in sextuplicate) over a period of three | compared using a one-tailed unpaired t-test (a) or a Mantel-Cox test (c). 
days. Representative of three independent experiments using the same cells. Four asterisks, P< 0.0001. 

b,c, Representative resected tumours (scale bar, 1 cm) at 20 days after injection 
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Extended Data Figure 3 | Identification of Cu-binding mutants of MEK1 
that decrease ERK1/2 phosphorylation. a, Immunoblot detection of the 
amount of HA-tagged wild-type MEK1 and an example of one MEK1 mutant 
tested (H188A) that bound to a Cu-charged resin. Input serves as a loading 
control. b, Immunoblot detection of the amount of phosphorylated (P) and/or 
total (T) ERK1/2 or HA-MEK1 protein in Ctr1*'* MEFs stably expressing 
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HA-tagged wild-type MEK] or an example of one MEK1 mutant tested 
(H188A). c, Summary of whether the indicated MEK1 point mutants did (YES) 
or did not (NO) show a decrease in binding to the Cu-charged resin or show a 
decrease in the levels of phosphorylated (P) ERK1/2 when stably expressed 
in Ctr1*'* MEBs. Gel images are representative of two technical replicates. 
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Extended Data Figure 4 | Amino acids in MEK] identified to be oxidized by —_ major identified peaks are labelled. f, Amino-acid sequence of human MEK1 
the MCO reaction followed by MS/MS. Representative annotated MS/MS with the peptides identified by MS/MS underlined (red, trypsin digest; blue, 
fragmentation spectra for five indicated MEK1-derived peptides containing chymotrypsin digest). Amino acids oxidized only in the presence of H,O) in 
oxidized residues highlighted in red: a, MEK1"*’ and MEK1™™*; b, MEK1'"™; _ one to three independent MCO reactions are denoted in red. Boxes enclose 
c, MEK1""*®; d, MEK1™?°; e, MEK1!"°, The peak heights are the relative amino acids that when mutated to alanine decreased both the binding of MEK1 
abundances of the corresponding fragmentation ions, with the annotation of to a Cu-charged resin and the phosphorylation of cellular ERK2 (from 

the identified matched amino-terminus-containing ions (b ions) in blueand —_ Extended Data Fig. 3c). 

the carboxy-terminus-containing ions (y ions) in red. For clarity, only the 
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MEK5 KVNTRAGPSQHSSPAVSDSLPSNSLKKSSAELKKILANGQMNEQDIRYRDTLGHGNGGTV 180 
MEK1-MEK5 ELDEQQR@------9--9-------- KRLEAFLTQKQKVGELKDDDIRYRDTLGHGNGGTV 82 
MEK1 FKVSHKPSGLVMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISIC 142 
MEK2 TKVQHRPSGLIMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISIC 146 
MEK5 YKAYHVPSGKILAVKVILLDITLELQKOIMSELEILYKCDSSYIIGFYGAFFVENRISIC 240 
MEK1-MEK5 YKAYHVPSGKILAVKVILLDITLELQKOIMSELEILYKCDSSYIIGFYGAFFVENRISIC 142 
MEK1 MEHMDGGSLDQVLKKAGRIPEQILGKVSIAVIKGLTYLREKHKIMHRDVKPSNILVNSRG 202 
MEK2 MEHMDGGSLDQVLKEAKRIPEEILGKVSIAVLRGLAYLREKHQIMHRDVKPSNILVNSRG 206 
MEK5 TEFMDGGSLDVYR----KMPEHVLGRIAVAVVKGLTYLWS-LKILHRDVKPSNMLVNTRG 295 
MEK1-MEK5 TEFMDGGSLDVYR----KMPEHVLGRIAVAVVKGLTYLWS-LKILHRDVKPSNMLVNTRG 197 
MEK1 EIKLCDFGVSGQLIDSMANSFVGTRSYMSPERLOGTHYSVOSDIWSMGLSLVEMAVGRYP 262 
MEK2 EIKLCDFGVSGQLIDSMANSFVGTRSYMAPERLOGTHYSVOSDIWSMGLSLVELAVGRYP 266 
MEK5 QVKLCDFGVSTOLVNSIAKTYVGTNAYMAPERISGEQYGIHSDVWSLGISFMELALGRFP 355 
MEK1-MEK5 QVKLCDFGVSTOLVNSIAKTYVGTNAYMAPERISGEQYGIHSDVWSLGISFMELALGRFP 257 
MEK1 IPPPDAKELELMFG-CQVEGDAAETP---PRPRTPGRPLSSYGMDSRPPMAIFELLDYIV 318 
MEK2 IPPPDAKELEAIFGRPVVDGEEGEPHSISPRPRPPGRPVSGHGMDSRPAMAIFELLDYIV 326 
MEKS5 SLMPLQLLOQCIV 376 
MEK1-MEK5 SLMPLQLLOCIV 278 
MEK1 NEPPPKLPSGVFSLEFQDFVNKCLIKNPAERADLKQLMVHAFIKRS-DAEEVDFAGWLCS 377 
MEK2 NEPPPKLPNGVFTPDFQEFVNKCLIKNPAERADLKMLTNHTFIKRS-EVEEVDFAGWLCK 385 
MEK5 DEDSPVLPVGEFSEPFVHF ITOCMRKQPKERPAPEELMGHPFIVOQFNDGNAAVVSMWVCR 436 
MEK1-MEK5 DEDSPVLPVGEFSEPFVHF ITQCMRKQPKERPAPEELMGHPFIVOFNDGNAAVVSMWVCR 338 
MEK1 TIG--LNQPSTPTHAAGV 393 

MEK2 TLR--LNQPGTPTRTAV- 400 

MEK5 ALEERRSQOQGPP 

MEK1-MEK5 ALEERRSQQGPP 


Extended Data Figure 5 | Alignment of the amino-acid sequences of MEK1, _letters, the four amino acids mutated in MEK1°™™ to decrease Cu binding 


MEK2, MEK5 and MEK1-MEKS. The amino-acid sequences of human (blue, conserved between MEK1, MEK2 and MEKS; red, conserved only 
MEK1, MEK2, MEKS and the MEK1-MEKS chimaeric protein (without the between MEK1 and MEK2). Dashes indicate gaps in the alignment. 
DD mutation) aligned using Clustal W. Black letters, amino acids; coloured 
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Extended Data Figure 6 | Protein purification and biochemical analysis of denaturation monitored at 222 nm at increasing temperature (c), and 
wild-type and CBM versions of MEK1. a, Coomassie blue detection of the —_ differential scanning fluorimetry at increasing temperature (d, left) and the 
amount of wild-type or CBM mutant purified recombinant GST-tagged MEK1 _ average estimated melting temperature (d, right) of purified recombinant 
protein in the absence or presence of precision protease for cleavage of GST. | MEK1™ (black circles and line) and MEK1°™™ (red squares and line). Data 
b-d, Circular dichroism spectra at increasing wavelengths (b), thermal are representative of two technical replicates. 
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Extended Data Figure 7 | Tumorigenic growth of NRAS mutation-positive 
human melanoma cancer cell lines on knockdown of CTR1. a,b, RT-PCR 
detection of the amount of endogenous CTRI and GAPDH mRNA (a) and 

mean tumour volume (+s.e.m.) versus time (b) of mice (n = 3) injected with 
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the NRAS mutation-positive (NRAS?*"") human melanoma cell lines DM598 
and DM792 stably infected with a retrovirus expressing either a scramble 
(SCRAM) shRNA (black circles) or CTR1 shRNA (red squares). Results were 
compared using a one-tailed unpaired t-test (b). Four asterisks, P< 0.0001. 
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Extended Data Figure 8 | Detection of Cre-mediated recombination and WT; red, flox; blue, null; orange, Braf~; green, Braf’°°*. b, Box-and-whiskers 
weight measurements of AdCre-treated Ctr1*’* versus Ctri“°“"* BP mice. _ plot of weight of Ctrl*/* versus Ctrl”“”* BP mice (n = 30) one month after 
a, PCR detection of Braf(/*, Trp53"“"* and Ctr#™"** recombined alleles intranasal treatment with AdCre. Results were compared using a one-tailed 
from matched tail samples (T) and lung tumour cell lines (C) generated unpaired t-test (b). Four asterisks, P< 0.0001. 
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Extended Data Figure 9 | TIM does not decrease the weight of mice with 


tumours. Mean weight (+s.e.m.) over time of mice (n = 4) injected with 
BRAFY®™_ transformed MEFs and treated with vehicle (black circles) or TTM 


(red squares). 
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BRAFY°™"©. mediated signalling and tumorigenesis. Inactivation of the in red. 
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Protective mucosal immunity mediated by epithelial 


CD1d and IL-10 


Torsten Olszak'*, Joana F. Neves!*, C. Marie Dowds”*, Kristi Baker’, Jonathan Glickman’*, Nicholas O. Davidson’, 
Chyuan-Sheng Lin®, Christian Jobin®, Stephan Brand’, Karl Sotlar®, Koichiro Wada’, Kazufumi Katayama’, Atsushi Nakajima’®, 
Hiroyuki Mizuguchi", Kunito Kawasaki!?, Kazuhiro Nagata’, Werner Miiller!?, Scott B. Snapper, Stefan Schreiber’, 


Arthur Kaser!, Sebastian Zeissig'?* & Richard S. Blumberg’* 


The mechanisms by which mucosal homeostasis is maintained are 
of central importance to inflammatory bowel disease. Critical to 
these processes is the intestinal epithelial cell (IEC), which regulates 
immune responses at the interface between the commensal micro- 
biota and the host’”. CD1d presents self and microbial lipid antigens 
to natural killer T (NKT) cells, which are involved in the pathogen- 
esis of colitis in animal models and human inflammatory bowel 
disease* *. As CD1d crosslinking on model IECs results in the pro- 
duction of the important regulatory cytokine interleukin (IL)-10 
(ref. 9), decreased epithelial CD1d expression—as observed in inflam- 
matory bowel disease’*'!— may contribute substantially to intestinal 
inflammation. Here we show in mice that whereas bone-marrow- 
derived CD1d signals contribute to NKT-cell-mediated intestinal 
inflammation, engagement of epithelial CD 1d elicits protective effects 
through the activation of STAT3 and STAT3-dependent transcription 
of IL-10, heat shock protein 110 (HSP110; also known as HSP105), 
and CD14 itself. All of these epithelial elements are critically involved 
in controlling CD1d-mediated intestinal inflammation. This is demon- 
strated by severe NKT-cell-mediated colitis upon IEC-specific dele- 
tion of IL-10, CD14d, and its critical regulator microsomal triglyceride 
transfer protein (MTP)'”'%, as well as deletion of HSP110 in the 
radioresistant compartment. Our studies thus uncover a novel path- 
way of IEC-dependent regulation of mucosal homeostasis and high- 
light a critical role of IL-10 in the intestinal epithelium, with broad 
implications for diseases such as inflammatory bowel disease. 

To examine the function of CD1d in the intestinal epithelium, we 
generated mice with tamoxifen-inducible IEC-specific deletion of MTP, 
a lipid transfer protein encoded by Mitp and required for CD1d func- 
tion! (Villin-CreER™ (ref. 14) X Mttp™’ l (ref. 15); hereafter referred 
toas Mttp*'"°). Mttp*'"© mice exhibited no evidence of epithelial lipid 
accumulation (Extended Data Fig. 2a) and endoplasmic reticulum (ER) 
stress (Extended Data Fig. 2b), but showed decreased IEC-mediated, 
CD1d- but not major histocompatibility complex (MHC)-class-I-dependent 
antigen presentation despite unimpaired CD1d expression (Extended 
Data Fig. 3a, b, c left). Mttp“!"° mice exhibited increased sensitivity to 
oxazolone-induced colitis, a CD1d- and NKT-cell-dependent colitis model’, 
as shown by increased mortality (Fig. 1a, left), weight loss (Fig. la, right) 
and histological injury (Fig. 1b and Extended Data Fig. 3d), in com- 
parison with wild-type littermates (Villin-CreER'*-negative Mttp™” 
mice). Increased severity of colitis was not caused by primary epithelial 
barrier defects (Extended Data Fig. 4a) but was due to loss of function 


of intestinal epithelial CD1d. Thus, bone marrow chimaeras with selec- 
tive deficiency of CD1d in the radioresistant compartment exhibited 
severe mortality and morbidity in the oxazolone model (Fig. 1c, d), 
increased production of pathogenic IL-13 and IL-1f in colon explants 
(Fig. le) and purified CD1 1b* lamina propria cells (Extended Data Fig. 4b), 
and reduced secretion of protective IL-10 (Fig. le). In accordance with 
a protective role of intestinal epithelial CD1d, its expression was upre- 
gulated in response to oxazolone colitis in wild-type, but not Mttp*!"S, 
mice (Extended Data Fig. 3c, right). Intriguingly, pathogenic signals were 
similarly CD1d-dependent but emanated from non-epithelial cells ina 
manner dependent on adaptive immune cells. Thus, Mttp“""° mice on 
aRagl~'~ or Cdld1~'~ Cd1d2~'~ (CD1d-knockout) background were 
protected from oxazolone colitis (Extended Data Fig. 4c, d). Consistent 
with a role of non-epithelial cells in the delivery of pathogenic CD1d 
signals, antibody-mediated neutralization of (non-epithelial) CD1d in 
Mttp“""© mice reduced the severity of oxazolone-induced colitis (Fig. 1f, g). 
Thus, intestinal epithelial CD1d- and MTP-dependent lipid antigen pre- 
sentation protects from oxazolone colitis, whereas CD1d- and NKT-cell- 
mediated inflammation emanates from non-epithelial cells, presumably 
bone-marrow-derived lamina propria cells. 

To investigate directly the role of epithelial CD1d, we developed mice 
with inducible, IEC-specific deletion of CD1d (Villin-CreER™? X Cdld1 f/f, 
hereafter referred to as Cd1d1*"®°) (Extended Data Fig. 5a). Tamoxifen- 
treated Cd1d1“" mice exhibited normal numbers of invariant (i) NKT 
cells and lacked intestinal epithelial expression of CD1d and CD 1d- but 
not MHC-class-I-mediated antigen presentation by IECs (Extended 
Data Fig. 5b-e). Oxazolone-challenged Cd1d1“'"© mice, like Mttp“'*© 
mice, exhibited increased mortality (Fig. 1h, left), weight loss (Fig. 1h, 
right), histopathological injury (Fig. 1i), colon shortening (Extended 
Data Fig. 5f) and elevated secretion of IL-13 and IL-1, as well as impaired 
secretion of IL-10 from colon explant cultures (Fig. 1j). Moreover, colonic 
iNKT cells from Cd1d1*""© mice showed increased IL-13 and IFN-y 
expression in oxazolone colitis (Fig. 1k). Thus, intestinal epithelial CD1d 
protects from intestinal inflammation. 

To gain insight into the mechanisms underlying protection by intest- 
inal epithelial CD1d, we performed transcriptional profiling of intestinal 
mucosal scrapings from wild-type and Mttp**© mice in the oxazolone 
model (Extended Data Table 1 and data not shown). HSP110, encoded 
by Hsph1, exhibited decreased expression in Mttp“'© mice (Fig. 2a), 
which was of interest given its known role in induction of epithelial CD1d 
expression”®. In line with a potential MTP- and CD1d-dependent role of 
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HSP110 in intestinal inflammation, HSP110 messenger RNA (Fig. 2b, 
left) and protein expression (Extended Data Fig. 6a) were increased in 
IECs from wild-type but not Mttp“!° mice upon oxazolone challenge. 
Similar observations were made for Cd1d1“'"© mice (Fig. 2b, right). In 
addition, intestinal epithelial but not lamina propria HSP110 expres- 
sion was reduced in active and inactive human ulcerative colitis com- 
pared with healthy controls (Fig. 2c and Extended Data Fig. 6b), which 
is in accordance with impaired epithelial CD1d expression in inflam- 
matory bowel disease, as previously reported’. Furthermore, consis- 
tent with regulation of CD1d expression by HSP110 (ref. 16), oxazolone 
colitis was associated with induction of CD1d expression in wild-type 
but not Mttp*'"° IECs (Extended Data Fig. 3c, right). Together, these 
data indicate that IEC CD1d and MTP are required for epithelial HSP110 
upregulation during oxazolone-induced colitis and that alterations in 
this pathway are observed in human ulcerative colitis. 
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To understand better the functional consequences of epithelial HSP110 
deficiency observed in Mttp*'®© and Cdid1*'© mice, bone marrow 
chimaeras of HSP110-knockout (HSP110-KO) or wild-type mice receiv- 
ing wild-type bone marrow were studied. Compared with wild-type 
recipients, HSP110-knockout recipients of wild-type bone marrow 
exhibited increased mortality and weight loss (Fig. 2d), microscopic 
injury (Fig. 2e) and increased IL-13 and IL-1f secretion (Fig. 2f) in the 
oxazolone model. Morbidity and mortality in HSP110-knockout reci- 
pients of wild-type bone marrow reflected that observed for conven- 
tional HSP110-knockout mice, suggesting critical, protective effects of 
HSP110 in the radioresistant compartment, including IECs (Extended 
Data Fig. 6c). Antibody-mediated neutralization of CD1d reduced the 
severity of colitis in HSP110-knockout bone marrow chimaeras, in 
line with pathogenic CD1d-dependent signals originating from non- 
epithelial lamina propria cells (Fig. 2d, f). Together, these studies indicate 
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Figure 2 | HSP110 elicits protective 
effects downstream of intestinal 
epithelial CD1d. a, b, Hsph1 
expression (qPCR) in purified IECs 
of the indicated mouse strains at 
baseline (a) or on day 5 after 

rectal oxazolone (Ox.) or 

ethanol (EtOH) (b). c, HSP110 
immunohistochemistry in colon 

of human ulcerative colitis and 
controls. d-i, Survival (d, bottom, 
h, left), body weight (d, top, h, right), 
histopathology (e, i), and cytokine 
secretion in colon explants (ELISA) 
(f) in the oxazolone model. 

d, f, Wild-type (WT) and HSP110- 
KO recipients of wild-type bone 
marrow received 19G11 anti-CD1d 
or isotype control antibody. 

g-i, Mice received adenoviruses 
expressing HSP110 (AdHSP110) 

or B-galactosidase (AdLacZ). 

g, Immunofluorescence of colonic 
GFP after AdHSP110-IRES2- 

eGFP administration. DAPI, 

4’ ,6-diamidino-2-phenylindole. 

j-, MODE-K Hsph1 and I10 
(qPCR) after antibody-mediated 
CD14d crosslinking (1B1) (j-k) or 
a-GalCer presentation to iNKT cells 
(24.7 hybridoma) in the presence/ 
absence of neutralizing CD1d 
antibodies (19G11) (1). mAb, 
monoclonal antibody. m, Hsph1 
and I/10 (qPCR) in purified small 
intestinal IECs from wild-type and 
TD-CD14d recipients of wild-type 
bone marrow after injection with 
vehicle or «-GalCer (6 mice per 
group). ¢, g, Scale bars, 50 um. 
Results representative of two or three 
independent experiments are shown. 
Means = s.e.m. of the indicated 
number of mice or triplicate cultures 
(1) are indicated. Student’s t-test or 
log-rank test (survival) were applied. 
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that reduced HSP110 expression in the absence of functional epithelial 
CD1d may contribute to severe oxazolone colitis. Consistent with this hy- 
pothesis, reconstitution of HSP110 expression by adenoviruses targeting 


IECs (Fig. 2g) ameliorated oxazolone-induced colitis in Mttp™ ~~ mice, 
as shown by decreased mortality (Fig. 2h, left), weight loss (Fig. 2h, right) 
and microscopic injury (Fig. 2i). It is noteworthy that adenoviruses also 
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Figure 3 | Epithelial CD1d induces STAT3- 
dependent expression of IL-10 and HSP110. 

a, MODE-K pSTAT3 after CD1d-crosslinking 
(1B1 monoclonal antibody clone) or iNKT 

(24.7 hybridoma) co-culture. b, Colonic pSTAT3 
immunohistochemistry 5 days after rectal 
oxazolone (Ox.) or ethanol (EtOH). Scale bar, 

50 uum. ¢, pSTAT3 in purified small intestinal 
IECs from the indicated bone marrow chimaeras 
(IECs from one mouse per lane). d-f, Cd1d1, 
Hsph1 and Il10 (qPCR) in MODE-K cells after 
treatment with recombinant HSP110 (e) or Stat3/ 
control siRNA knockdown (d, f) followed by viral 
transduction at a multiplicity of infection (MOI) 
of 5 (f). g, 1110, Hsph1 and Tgfb in purified IECs 
(qPCR) 6h after rectal oxazolone in the presence 
or absence of viral infection. The fold change 
compared to oxazolone-treated Cdid1*'* mice 
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targeted lamina propria cells (Fig. 2g), which may have contributed to 
protection from intestinal inflammation. 

As mice with an IEC-specific deficiency in MTP and CD 1d exhibited 
impaired epithelial expression of HSP110, we tested whether engage- 
ment of epithelial CD1d supports HSP110 expression. To this end, we 
investigated HSP110 in purified (Extended Data Fig. 7) colonic IECs 
after antibody-mediated CD 1d crosslinking, a treatment previously shown 
to induce production of barrier-protective IEC-derived IL-10 (ref. 9). 
CD1d crosslinking led to increased expression of [110 (Fig. 2j) and Hsph1 
(Fig, 2k) in IECs from wild-type but not Mitp“"“ mice. Similar observations 
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were made upon in vitro co-culture of NKT cells with a mouse IEC line 
(Fig. 21). Previous studies in a cultured human IEC line suggested that 
epithelial IL-10 production upon CD1d crosslinking is elicited through 
retrograde signalling dependent on the cytoplasmic tail of CD1d’. We 
therefore investigated whether bone marrow chimaeras with selective 
deletion of the cytoplasmic CD14 tail (hereafter referred to as tail-deleted 
(TD)-CD14) in radioresistant cells exhibit impaired expression of [110 
and Hsph1 in purified IECs. Indeed, IECs obtained from these mice 
showed impaired expression of 1/10 and Hsph1 under constitutive con- 
ditions and reduced expression after administration of the CD1d-binding, 
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iNKT-cell-activating lipid «-galactosylceramide («%-GalCer) (Fig. 2m). 
These data demonstrate that engagement of epithelial CD1d induces 
expression of HSP110 and IL-10, presumably through retrograde CD1d 
signalling. Additional effects of the CD1d cytoplasmic tail, such as 
regulation of intracellular CD1d trafficking” and potential effects on 
NKT cell homing’, may further contribute to protection from intest- 
inal inflammation. 

As the 1/10, Hsph1 and Cdl1d1 genes contain STAT sites in their 
promoters (ref. 18 and T.O. et al., unpublished observations), and as 
Hsph1 and Cd1d1 exhibit reduced expression in Stat3-deficient IECs 
(Gene Expression Omnibus accession number GSE15955)’”, we rea- 
soned that STAT3 may contribute to 110 and Hsph1 expression down- 
stream of CD1d. Indeed, epithelial STAT3 became phosphorylated in 
response to CD1d engagement by antibody crosslinking (Fig. 3a, left) 
and iNKT-cell co-culture (Fig. 3a, right, and Extended Data Fig. 8a). 
Moreover, impaired epithelial STAT3 phosphorylation was observed 
in Cdid1“""© mice (Fig. 3b) and TD-CD14d recipients of wild-type bone 
marrow (Fig. 3c). Furthermore, short interfering RNA (siRNA)-mediated 
knockdown of Stat3 in the IEC line MODE-K (Extended Data Fig. 8b) 
led to reduced expression of Cd1d1, Hsph1 and I110 (Fig. 3d). Together, 
these data demonstrate that STAT3 acts downstream of epithelial CD1d 
to induce expression of [110, Hsph1 and Cd1d1. 

IL-10 and HSP110 do not only exhibit STAT3-dependent expression 
but also signal via STATS (refs 18, 20) and may thus further contribute 
to sustained activation of epithelial STAT3 and STAT3-dependent 
expression of Cd1d1, 110 and Hsph1. Consistent with this concept, both 
recombinant and adenoviral HSP110 induced epithelial expression of 
110 and Cd1d1 in MODE-K cells, which was abrogated upon siRNA- 
mediated knockdown of Stat3 (Fig. 3e, f). Similar findings were made 
in vivo, as adenoviruses expressing HSP110, but not LacZ, restored 
epithelial expression of IL-10 in Cd1d1°""© mice in the oxazolone model 


LETTER 


(Fig. 3g, h). These data demonstrate that HSP110 acts in a STAT3- 
dependent manner to support the expression of IL-10 and CD1d by IECs. 

Given HSP110-dependent regulation of epithelial IL-10 expression, 
we investigated whether the protective effects of HSP110 are dependent 
on IL-10. CD11b* lamina propria cells from Cd1d1°'"© mice showed 
decreased phosphorylated (p)STAT3 expression in oxazolone colitis, 
consistent with decreased IL-10 production by IECs and IL-10 receptor 
(IL-10R) signalling by lamina propria cells (Fig. 3i). Moreover, antibody- 
mediated IL-10R blocking prevented adenoviral HSP110-dependent 
amelioration of colitis (Fig. 3j-l). To investigate further the role of epi- 
thelial IL-10, we generated 1/10 ‘~ bone marrow chimaeras. II10 dele- 
tion in the radioresistant compartment resulted in increased mortality 
(Fig. 4a) and colonic IL-13 production (Fig. 4b) in the oxazolone model. 
Moreover, a similar trend was observed in mice with I/10 deletion in 
the radiosensitive compartment (Fig. 4a, b). These data thus reveal that, 
in addition to the established role of bone-marrow-derived cells in IL- 
10-mediated control of intestinal inflammation”!”’, IL-10 derived from 
radioresistant cells provides an additional, non-redundant pathway re- 
quired for regulation of intestinal inflammation. To address whether 
protective IL-10 originates from the intestinal epithelium, we investi- 
gated oxazolone colitis in mice with inducible, epithelial-specific dele- 
tion of 1110 (Villin-CreER™ X III 0", hereafter referred to as II oATEC) 
Similar to mice with IEC-specific deletion of Mttp and Cd1d1 and mice 
with deletion of Hsph1 and I110 in the radioresistant compartment, 
1110“""© mice exhibited increased mortality (Fig. 4c), weight loss (Fig. 4d) 
and histopathological inflammation (Fig. 4e) in the oxazolone model. 
Furthermore, in accordance with STAT3-mediated signalling by IL-10 
(ref. 18) and STAT3-dependent expression of Hsph1 and Cd1d1 (Fig. 3d), 
IEC-specific 1110 deletion was associated with reduced epithelial expression 
of Hsph1 and a trend towards a reduction in Cd1d1 expression in the 
oxazolone model (Fig. 4f). These results demonstrate that IEC-derived 
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IL-10, downstream of epithelial CD1d, HSP110 and STATS3 is critical 
for regulation of intestinal inflammation. 

NKT cells are central mediators of intestinal inflammation**. In 
studies aiming to identify the cellular origin of pathogenic lipid antigen 
presentation in oxazolone colitis, we made the unanticipated obser- 
vation that intestinal epithelial CD1d protects from intestinal inflam- 
mation in a manner dependent on STAT3, IL-10 and HSP110, whereas 
pathogenic NKT-cell activation emanates from bone-marrow-derived 
cells in a CD1d-restricted fashion (Extended Data Fig. 1a). As further 
outlined in the Supplementary Discussion and Extended Data Fig. 9, 
such opposing roles of CD1d-restricted presentation by bone-marrow- 
derived and epithelial cells are at least partially dependent on the expres- 
sion of co-stimulatory molecules such as CD40. Our findings highlight 
the role of the intestinal epithelium as a key regulator of mucosal immun- 
ity and underscore the important role of intestinal epithelial CD1d and 
IL-10 in determining the host’s response to environmental stimuli, 
which activate CD1d-restricted NKT cells’. 


METHODS SUMMARY 


Mice were housed in a specific pathogen-free (SPF) barrier facility. HSP110-KO 
(ref. 23), Cdid1~'~ Cd1d27'~ (ref. 24), TD-CD1d (ref. 17), I1107/~ (ref. 25), 
Villin-CreER™ (ref. 14), Mttp™" (ref. 15), 1110" (ref. 26), and Ragl '~ (ref. 27) 
mice have been described. For Cdid1“"© mice, please refer to Methods. To activate 
Cre recombinase, 4—5-week-old Villin-CreER™ mice were orally gavaged daily with 
1 mg tamoxifen for a total of 5 days. For bone marrow chimaeras, recipients received 
total body irradiation of 1,100 rad in two separate doses 4h apart. The next day 
0.5 X 10° bone marrow cells were delivered intravenously. Oxazolone colitis was 
induced as described previously* and as outlined in Methods. Where indicated, 
mice received 1 mg of neutralizing antibodies against IL-10R (R&D), CD1d (19G11, 
BioXcell) or isotype control intraperitoneally (i-p.), or 5 X 10° adenoviruses in 
PBS i.p., each before skin sensitization and before and 2 days after rectal oxazolone 
challenge. For a schematic overview of procedures please refer to Extended Data 
Fig. 1b. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. Mice (C57BL/6] unless indicated otherwise) were housed in a specific pathogen- 
free (SPF) barrier facility. Cdid1~'~ Cd1d2~'~ (ref. 24), TD-CD1d (ref. 17), I10/~ 
(ref. 25), Villin-CreER”? (ref. 14), 1110" (ref. 26), Rag1~/~ (ref. 27), HSP110-KO 
(ref. 23) (C57BL/6N) and Mttp™ 8 (ref, 15) (C57BL/6J/129) mice have been described. 
To generate the floxed/neo Cd1d1 allele, a LoxP (L83) site at 10310 and a FNFL 
(Frt-Neo-Frt-LoxP) cassette at 12190 was inserted to flank exon 2, 3 and 4 (about 
1.9 kb) of the Cdld1 gene. A gene-targeting vector was constructed by retrieving 
one 5 kb long homology arm (5’ to L83), one 1.9 kb sequence containing exon 2/3/4, 
FNFL cassette, and one 2 kb short homology arm (end of FNFL to 3’). The FNFL 
cassette conferred G418 resistance during gene targeting in PTL1 (129B6 hybrid) 
embryonic stem (ES) cells. The Cd1d1 allele was PCR amplified and sequenced to 
confirm the targeted C57BL/6] allele based on the C57BL/6]-specific mutation in 
Cd1d2 allele’*. Three targeted ES cells with targeted C57BL/6J allele were injected 
into C57BL/6J blastocysts to generate chimaeras. Male chimaeras were bred to 
bACTFlpe females or ElIa-Cre females to transmit the floxed Cd1d1 allele (Cd1d1 | 
(with neo cassette removed by Fple recombinase) and Cd1d1-null allele (Cd1d1 oe) 
(with exon 2/3/4 and neo cassette removed by Cre recombinase) through the germ 
line. Mice carrying floxed Cd1d1 alleles were crossed to Villin-CreER™ mice (ref. 14) 
expressing Cre recombinase. 

To activate Cre recombinase, 4—5-week-old Villin-CreER'? mice were orally 
gavaged daily with 1 mg tamoxifen for a total of 5 days. For bone marrow chimaeras, 
recipients received total body irradiation of 1,100 rad in two separate doses 4h 
apart. The next day, 0.5 X 10° bone marrow cells were delivered intravenously. For 
a-GalCer studies, mice were injected with 2 ug of o-GalCer (KRN7000, Avanti 
Polar Lipids) intraperitoneally (i-p.) 8 weeks after bone marrow reconstitution and 
RNA was isolated from Percoll-purified IECs (see later) 12h after injection. For 
oxazolone colitis, mice were pre-sensitized by abdominal skin application of 3% (w/v) 
oxazolone in 100% ethanol, followed by rectal administration of 1% oxazolone 
in 50% ethanol after 5 days. Where indicated, mice received 1 mg of neutralizing 
antibodies against IL-10R (R&D), CD1d (19G11, BioXcell) or isotype control i-p., 
or 5 X 10° adenoviruses in PBS ip., each before skin sensitization and before and 
2 days after rectal oxazolone challenge. Body weight, rectal bleeding and stool con- 
sistency were analysed on a daily basis. Tissues were examined for evidence of colitis 
by five established criteria for colitis activity in a blinded fashion by a pathologist 
(J.G.): hypervascularization, presence of mononuclear cells, epithelial hyperplasia, 
epithelial injury, and presence of granulocytes. For a schematic overview of pro- 
cedures please refer to Extended Data Fig. 1b. 

Animal studies were conducted in a gender- and age-matched manner using 

littermates for each experiment. Both male and female mice were used and were 
8 weeks of age at the time of experiments (11-16 weeks of age for bone marrow 
chimaera studies). The number of animals used per group was based on previous 
experimental results and observed variability. Histopathology analysis was per- 
formed in a blinded manner. For all other in vitro and in vivo analyses, investiga- 
tors were not blinded to treatment allocation. Animal studies were performed in 
compliance with ethical regulations and were approved by the ethics committees 
of Harvard Medical School and the Christian Albrechts University. 
Patients. Human studies were approved by the ethics committee of the Ludwig 
Maximilians University. All subjects provided written informed consent. Immuno- 
histochemical staining was performed on intestinal biopsies from ten healthy controls 
(patients undergoing screening colonoscopy without detection of malignancy), four 
patients with active ulcerative colitis, and five patients with inactive ulcerative colitis. 
Isolation of IECs and lymphocytes, flow cytometry, and sorting. To isolate IECs, 
the intestines were opened, washed in cold PBS, cut into pieces, and transferred 
into tubes containing HBSS without calcium and magnesium. After the addition of 
dithiothreitol (DTT) at 1mM (Sigma), intestines were shaken at 250r.p.m. for 
10 min at room temperature to remove mucus. After washing in PBS, tissues were 
incubated in RPMI-1640 with 1 U ml! Dispase (Roche) for 30-40 min at 37 °C at 
250r.p.m. After filtering through a 100 jm strainer and centrifugation for 5 min at 
1,500 r.p.m., the pellet was resuspended in 5 ml of 100% Percoll and placed beneath 
8 ml of 40% Percoll. After centrifugation for 20 min at 1,500 r.p.m. the top layer 
was removed, washed three times in PBS and used as IECs. 

To isolate intraepithelial (IEL) and lamina propria lymphocytes, large intestines 
were collected, fat tissue removed, the intestines cut longitudinally and washed in 
PBS in order to remove faecal content, cut into 30 mm pieces, and shaken in RPMI- 
1640 containing 20 mM HEPES, 1 mM DTT and5 mM EDTA for 30 min at 37 °C. 
To isolate IELs, cells derived from the epithelial compartment were resupended in 
4 ml of 40% Percoll and layered on top of 2 ml of 75% Percoll. After centrifugation 
for 20 min at 2,000 r.p.m. the middle layer was removed, washed in 2% FBS in PBS 
and the IELs were obtained. For lamina propria lymphocytes, the non-epithelial 
compartment was washed in PBS, cut into smaller pieces and incubated in RPMI- 
1640 containing 5% FBS, 1.5 mg ml collagenase type II and 0.5 mg ml * dispase 
(GIBCO) for 1 h at 37 °C under constant horizontal shaking (250 r.p.m.). The digested 
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tissues were filtered through a 40 jm strainer and, after Percoll gradient centrifu- 
gation and washing as described earlier, were used as lamina propria lymphocytes 
in flow cytometry. Where indicated, CD11b* lamina propria cells were purified 
using magnetic beads according to the manufacturer’s instructions (Miltenyi Biotec). 
Liver lymphocytes were prepared as described before”. 

Primary iNKT cells were sorted by double staining with PBS57-loaded CD1d 
tetramers and CD3¢ using a BD FACSAria II SORP UV sorter. RNA samples were 
prepared using an RNeasy Micro Kit and cDNAs were synthesized using the Omniscript 
RT Kit (Qiagen). 

PBS57-loaded or -unloaded CD14 tetramers were obtained from the NIH Tetramer 
Core Facility. Flow cytometry antibodies were obtained from eBiosciences. Flow 
cytometry was performed using a MACSQuant (Miltenyi Biotec) anda BD FACSVerse 
(BD Biosciences) and data were analysed with FlowJo software (TreeStar). 
Antigen presentation assays. Freshly isolated IECs, MODE-K cells”, and CD11c- 
magnetic-bead-purified (Miltenyi Biotec) splenic dendritic cells were incubated 
with 100 ng ml ! o-GalCer for 4h, washed three times with PBS, and aliquoted 
into 96-well plates at 1 X 10° cells per well. NKT-cell hybridomas (24.7, DN32.D3 
and 148.6 (ref. 31)) were added at a 1:1 ratio. Where indicated, soluble, stimulatory 
CD28 antibody was added (1 pg ml~ 1 clone 37.51; eBioscience). In addition, where 
indicated, cells were transfected with expression plasmids or siRNA 24h and 72h, 
respectively, before co-culture with NKT cells. Mouse IL-2, IL-10, IL-12p70, IL-13 
and IL-1B production were assessed by ELISA (OptEIA; BD Biosciences) in super- 
natants after 24h of co-culture. 

Cell stimulation by crosslinking. CD1d crosslinking on freshly isolated colonic 
IECs was achieved as described previously” using plate-bound anti-mCD1d anti- 
body (1B1, BD). In the case of MODE-K cells, surface CD1d was crosslinked by 
addition of soluble anti-mCD1d antibody (1B1, 10 pg ml ', 60 min) followed by 
washing and addition of secondary goat anti-rat IgG (Biolegend, 10 ug ml *) for 
the indicated time as well as subsequent cell lysis (see later). MODE-K STAT3 
phosphorylation in response to iNKT-cell engagement was investigated after co- 
culture with the iNKT-cell hybridoma 24.7 for the indicated time, removal of non- 
adherent iNKT cells by thorough washing, and cell lysis. Where indicated, MODE-K 
cells were pre-treated with 19G11 anti-CD1d antibody for blocking of CD1d 
(10 pg ml !, BioXcell). 

Adenoviruses. Type 5, E1/E3-deleted adenoviruses expressing murine HSP110 
(AdHSP110) or Escherichia coli B-galactosidase (AdLacZ) under the control of the 
cytomegalovirus promoter, were constructed according to the in vitro ligation 
method****, Briefly, murine Hsph1 (NM-013559) was isolated and amplified by PCR 
from mouse intestinal cDNA using the following primers, cloned into Nhel and 
EcoRI sites of pIRES2-eGFP (Clontech), sequenced, and the resulting plasmid was 
called pHSP110-IRES2-eGFP. Forward primer: ATAAAGCTAGCGCCACCatgtc 
ggtgettgeectag, containing an Nhel site and a Kozak like sequence; reverse primer: 
AATTAGAATTCTTAgtccaggtccatgttgacagagc, containing an EcoRI site (the sequence 
complementary to Hsph1 cDNA is shown in lower case). An NheI-HSP110-KpnI 
or NheI-HSP110-IRES2-eGFP-NotI fragment from pHSP110-IRES2-eGFP was 
transferred into PHMCMV6 (pHMCMV6-HSP110 and pHMCMV6-HSP110- 
IRES2-eGFP, respectively). Subsequently, pAdHM15RGD-CMV6-HSP110 and 
pAdHM15RGD-CMV6-HSP110-IRES2-eGFP were constructed by ligating I-Ceul/PI- 
Scel-digested pAdHM15-RGD and pHMCMV6-HSP110-IRES2-eGFP or PHMCMV6- 
HSP110. To prepare the virus, a Pacl fragment of pAdHM15-RGD-CMV-HSP110 
or pAdHM15-RGD-CMV-HSP110 was transfected into 293 cells. The adeno- 
viruses were purified by two rounds of caesium chloride density centrifugation 
and dialysed against Tris-HCl (pH 7.5). The concentrations of plaque-forming 
units (p.f.u.) of individual stocks were determined from the tissue culture infec- 
tious dose 50 titre. 

Colon organ culture. Standardized segments (1 cm X 1 cm) of the transverse colon 
were washed in cold PBS supplemented with penicillin and streptomycin (GIBCO) 
and cultured in 24-well flat-bottom culture plates (Falcon) in RPMI 1640 media 
(GIBCO) for 24h at 37 °C. Supernatants were analysed for cytokines IL-10, IL-13 
and IL-1 by ELISA (all BD). 

Western blotting. Protein extraction and western blotting were performed as 
described previously**. The following antibodies were used: anti-IL10 (R&D), anti- 
HSP110 (BD), anti-STAT3 (Cell signaling), anti-pSTAT3 (Cell signaling) and anti- 
B-actin (Cell Signaling). 

Real-time RT-PCR. RNA samples were prepared using an RNeasy Mini Kit and 
cDNAs were synthesized using the Omniscript RT Kit (Qiagen). Real-time RT- 
PCR was performed using SYBR Green I Master Mix (Roche) and a CFX96 Real- 
Time System (Bio-Rad). Values were normalized to B-actin. The following primer 
sets were used: -actin, 5’-GATGCTCCCCGGGCTGTATT-3’ and 5’-GGGGTA 
CTTCAGGGTCAGGA-3’; Hsph1, 5'-CAGGTACAAACTGATGGTCAACA-3’ 
and 5'-TGAGGTAAGTTCAGGTGAAGGG- 3’; 1110, 5'-GAGAGCTGCAGGGC 
CCTTTGC-3’ and 5’-CTCCCTGGTTTCTCTTCCCAAGACC:3’; Tefb, 5'-TGT 
ACGGCAGTGGCTGAACCA-3’ and 5'-TGTCACAAGAGCAGTGAGCGCT-3’, 
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Cd1d1, 5'-GCAGCCAGTACGCTCTTTTC-3’ and 5'-ACAGCTTGTTTCTGGC 
AGGT-3’; Stat3, 5'-CCCGTACCTGAAGACCAAGT-3’ and 5'-ACACTCCGA 
GGTCAGATCCA-3’; 113, 5'-AGCATGGTATGGAGTGTGGACCTG-3’ and 
5'-CAGTTGCTTTGTGTAGCTGAGCAG-3’, Ifng 5'-TCAGCAACAGCAAGG 
CGAAAAAGG-3’ and 5'-CCACCCCGAATCAGCAGCGA-3’. 

Transfection, siRNA treatment and recombinant HSP110. Where indicated, 
MODE-K cells were transfected with mouse CD40 or CD1d (in pSRa-neo), with 
SignalSilence Stat3 siRNA II or with SignalSilence Control siRNA (Cell Signaling 
Technology) using Lipofectamine 2000 (Life Technologies) according to the man- 
ufacturer’s instructions. /10 siRNA was obtained from Invitrogen and transduced 
by Amaxa Nucleofector technology (Lonza) according to the manufacturer’s instruc- 
tions. Cells were used for antigen presentation assays after 72 h or were incubated 
with the indicated adenoviruses after 48 h at a MOI of 5 before RNA preparation 
after another 24h as described earlier. Where indicated, recombinant HSP110 
was added at a final concentration of 6 jig ml ' to MODE-K cells 24h before RNA 
extraction. 

XBP1 splicing assay. The assay was performed as described previously”®. 

In vivo permeability assay. In vivo intestinal permeability was measured using the 
FITC-labelled dextran (FD-4) method as described previously*’** with modifica- 
tions. Mice were administered 150 il of FD-4 (50 mg ml‘) by oral gavage before 
and 18h after rectal challenge with oxazolone. Serial dilutions of FD-4 were made 
to generate a standard curve and serum concentrations of FD-4 were determined 
using a BioTek FLx800 Fluorescence Microplate Reader (BioTek) with an excita- 
tion wavelength of 490 nm and emission wavelength of 530 nm. 
Immunohistochemistry and immunofluorescence. A polyclonal rabbit anti- 
HSP110 antibody (anti-mouse 1:250, anti-human 1:500; Sigma-Aldrich), an anti- 
STAT3 (anti-mouse 1:900) and anti-pSTAT3 antibody (anti-mouse 1:350), both 
from Cell Signaling, as well as a rabbit polyclonal anti-GFP antibody conjugated to 
FITC (1:1,000) (Novus Biologicals) were used. All images were taken by a digital 
camera on a Nikon Eclipse Ti microscope. 

Osmium tetroxide staining of intracellular lipid droplets. Intestinal tissue was 
fixed in 10% neutral buffered formalin and transferred into 1% osmium tetroxide 
with periodic shaking. The tissue was rinsed with distilled water and incubated in 
0.5% periodic acid, washed and counterstained with haematoxylin and eosin. 
Microarray analysis. M ttp®PC and littermate wild-type mice were skin sensitized 
with oxazolone, and IECs were harvested via colonic epithelial scraping as described 
previously”, either before administration of rectal oxazolone (n = 2 per group) or 


6h thereafter (n = 4 per group). Isolated RNAs were pooled for each condition and 
subjected to transcriptomic analysis with mouse genome 430 2.0 array (Affymetrix) 
at the Biopolymers Core Facility (Harvard Medical School). Data analysis was 
performed with Agilent GeneSpring GX and Affymetrix GCOS software under 
default parameter settings with GC-RMA normalization. 

Statistical analysis. Statistical testing was performed using the unpaired Student’s 
t-test unless otherwise indicated. The Mann-Whitney U-test was applied when 
data were demonstrated to not follow a Gaussian distribution, which is indicated 
in the respective figures legends. Comparisons of mortality were made by analys- 
ing Kaplan-Meier survival curves, and the log-rank test was used to assess differ- 
ences in survival. All P values were two-tailed, and statistical significance was 
assumed at P< 0.05. 
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Extended Data Figure 1 | Proposed model of CD14 signalling in polarized 
intestinal epithelia and overview of experimental procedures. a, The 
proposed model of protective (blue) and pathogenic (red) effects of lipid 
antigen presentation in intestinal inflammation. Bone-marrow-derived 
antigen-presenting cells (APCs) contribute to oxazolone colitis ina CD1d- and 
iNKT-cell-dependent manner. By contrast, engagement of intestinal epithelial 
CD1d elicits protective functions through cytoplasmic CD14 tail-dependent 
activation of STAT3, and STAT3-dependent transcription of Cd1d1, I110 and 
Hsph1. Epithelial IL-10 and HSP110 support this protective self-reinforcing 
pathway through STAT3-dependent signalling. Interference with any of the 
elements involved in this regulatory pathway (MTP, CD14, IL-10 or HSP110) is 
associated with uncontrolled intestinal inflammation, thus highlighting a critical 
role of this pathway in the control of intestinal inflammation. Conv., 
conventional. b, Overview of experimental procedures. 
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Extended Data Figure 2 | Absence of enterocyte lipid accumulation and ER 
stress in Mttp*'"© mice. a, Absence of lipid accumulation in Mttp“"° mice as 
shown in representative haematoxylin and eosin and rare osmium tetroxide 

(arrows) staining in the caecum, proximal, and distal colon. Scale bar, 25 jum. 
b, Xbp1 splicing in small intestinal (SI) and colonic IECs of the indicated mice 
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before (left) (n = 5 mice per group) and 6h after rectal challenge with 
oxazolone (Ox., right) (n = 5 mice per group). MODE-K cells were treated with 
either thapsigargin (Tg) or vehicle (DMSO) as positive and negative controls, 
respectively. Results representative of three independent experiments are 
shown. 
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Extended Data Figure 3 | Impaired CD1d- but not MHC-class I-restricted 
antigen presentation in Mttp*™© mice. a, CD1d-mediated presentation of 
the exogenous lipid antigen «-GalCer to the invariant NKT-cell hybridomas 
24.7 and DN32.D3 and of endogenous lipid antigens (autoreactivity) to the 
non-invariant NKT-cell hybridoma 148.6 by IECs from CD1d-knockout 
(CD1d-KO) mice, Mttp™! © mice and wild-type littermates (Mttp"! Yn=5 
mice per group). b, Presentation of H2-K?-restricted SIINFEKL by the 
indicated IECs (see a) to the SIINFEKL-responsive hybridoma RF33.70. 


c, Representative histograms of CD1d cell surface expression as determined 
by flow cytometry of colonic IECs of the indicated mouse strains before 

(left, n = 5 mice per group) and 6h after (right, n = 5 mice per group) rectal 
oxazolone challenge. d, Representative haematoxylin and eosin stainings of 
the indicated mouse strains upon rectal challenge with oxazolone (Ox.) or 
vehicle (ethanol (EtOH)). Scale bar, 40 ttm. Results representative of three 
independent experiments are shown. Mean + s.e.m. of triplicate cultures are 
shown. Student’s t-test was applied. 
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Extended Data Figure 4 | Increased morbidity and mortality in 
oxazolone-challenged Mttp*™© mice is due to CD1d-restricted components 
of the adaptive immune system. a, Intestinal permeability as determined 

by FD-4 before and 18 h after rectal challenge with oxazolone (ox.) in the 
indicated mouse strains. Each symbol represents a single mouse. b, IL-1B 
secretion by CD11b™ cells from colonic lamina propria of the indicated bone 
marrow chimaeras. CD11b’ cells were isolated using magnetic microbeads 24h 
after rectal challenge with oxazolone or ethanol and cultured for 24h before 
measurement of IL-1 in culture supernatants by ELISA. Mean + s.e.m. of 
triplicate cultures are shown. c, d, Survival and body weight of the indicated 
mouse strains at the indicated days after rectal oxazolone challenge. Results 
representative of three independent experiments are shown. Mean = s.e.m. 

of the indicated number of mice are shown. Student’s t-test was applied. 
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Extended Data Figure 5 | Development and characterization of mice with 
IEC-specific Cd1d1 deletion. a, Schematic map of the targeting strategy for 
generation of Cdid1"/" mice. A LoxP (L83) site was inserted at 10310 and a 
FNEL (Frt-Neo-Frt-LoxP) cassette at 12190 to flank exons 2, 3, and 4 (about 
1.9 kb) of the Cdld1 gene to generate the ‘floxed/neo’ Cd1d1 allele. A gene- 
targeting vector was constructed by retrieving one 5 kb long homology arm 
(5’ to L83), one 1.9 kb sequence containing exon 2/3/4, FNFL cassette, and one 
2kb short homology arm (end of FNFL to 3’). The FNFL cassette conferred 
G418 resistance during gene targeting in PTL1 (129B6 hybrid) embryonic stem 
(ES) cells. The targeted Cd1d1 allele was PCR amplified and sequenced to 
confirm the targeted C57BL/6 allele based on the C57BL/6-specific mutation in 
the Cd1d2 allele. Three targeted ES cells with targeted C57BL/6 allele were 
injected into C57BL/6 blastocysts to generate chimaeric mice. Male chimaeras 
were bred to bACTHipe females or ElIa-Cre females to transmit the floxed 
Cdl1d1 allele (Cd1d1"'*) (with neo cassette removed by Fple recombinase) 
through the germ line. Mice carrying the floxed Cd1d1 allele were crossed to 
tissue-specific VillinCre-ER’? and exons 2, 3 and 4 and Frt-LoxP were removed 
by Cre recombinase in intestinal epithelial cells. b-e, Impaired CD1d- but not 


MHC-class I-restricted antigen presentation in Cd1 d1*™© mice. b, CdidI**° 
mice exhibit normal numbers of invariant NKT cells in liver, spleen and colonic 
lamina propria as determined by flow cytometry using «-GalCer/CD1d 
tetramers (n = 4 mice per group). c, Representative histograms of CD1d cell 
surface expression on colonic IECs of the indicated mouse strains before (left, n 
= 5 mice per group) and 6h after (right, n = 5 mice per group) rectal oxazolone 
challenge. d, CD1d-mediated antigen presentation by colonic IECs of the 
indicated mouse strains (n = 5 mice per group). Presentation of the model 
glycolipid antigen a-GalCer to invariant NKT-cell hybridomas 24.7 and 
DN32.D3 and of endogenous antigens (autoreactivity) to the non-invariant 
NKT-cell hybridoma 148.6 is shown. e, Presentation of H2-K°-restricted 
SIINFEKL presentation by the indicated IECs (n = 5 mice per group) to the 
SIINFEKL-responsive hybridoma RF33.70. Results representative of three 
independent experiments are shown b-e, Mean + s.e.m. of triplicate cultures 
are shown. Student’s t-test was applied. f, Intestinal epithelial CD1d is 
protective in oxazolone colitis. Representative macroscopic colon images of 
Cd1d1“™° and wild-type littermates upon rectal challenge with oxazolone 
(Ox., n = 5 mice per group) or vehicle (ethanol (EtOH), n = 4 mice per group). 


©2014 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 6 | Epithelial HSP110 expression is decreased 

in Mi tp“ mice and in human ulcerative colitis. a, HSP110 
immunohistochemistry in mice with IEC Mttp deletion (bottom) as compared 
with wild-type littermates (top) 6 h after rectal oxazolone challenge. b, HSP110 
immunohistochemistry in the colonic intestinal epithelium and the lamina 
propria of healthy controls and patients with active and inactive ulcerative 
colitis. Signal intensity was scored on a scale from 0 to 3 in a blinded fashion. 
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Each symbol represents a single patient. The median and significance level 

as determined by the Mann-Whitney U-test are shown. c, Mortality in 
conventional HSP110-knockout mice in the oxazolone colitis model. Results 
representative of three independent experiments are shown. Mean + s.e.m. of 
the indicated number of mice is shown. Student’s t-test and the log-rank test 
(survival) were applied. NS, not significant. 
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Extended Data Figure 7 | Purity of isolated IECs. After isolation of colonic __ epithelial cells that do not stain with leukocyte markers (middle and bottom). 
IECs, potential contamination with haematopoetic cells was investigated by The upper right panel shows isotype control staining. Results representative of 
flow cytometry. The major population of cells was gated (upper left), wasshown _ three independent experiments (n = 5 mice per experiment) are shown. 


to contain viable cells (upper middle), and to contain largely E>CAM-positive 
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Extended Data Figure 8 | CD1d-dependent STAT3 phosphorylation upon 
co-culture of IECs and iNKT cells. a, STAT3 phosphorylation in IECs upon 
co-culture with iNKT cells is CD1d-dependent. Expression of pSTAT3, STAT3 
and f-actin as determined by western blotting of MODE-K cells at the 
indicated time after addition of the iNKT-cell hybridoma 24.7. Where 
indicated, MODE-K cells were pre-incubated with a monoclonal blocking 
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antibody directed against CD1d (19G11; +) or an isotype control (—). 

b, siRNA-mediated knockdown of Stat3 in MODE-K cells as determined 

68h after siRNA transfection. Results representative of three independent 
experiments are shown. Means = s.e.m. of triplicates are shown. Student’s t-test 
was applied. 
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Extended Data Figure 9 | CD1d-mediated cytokine production is d, f, MODE-K cells were transfected with CD40 (d) and CD14 (f) as indicated. 
dependent on the APC and expression of co-stimulatory molecules. a-f,IL- | Histograms demonstrated increased expression of CD40 (d) and CD14d (f) after 
10 (a, c-f) and IL-12p70 (b) secretion of co-cultures of the IEC line MODE-K or _ transfection. e, Stimulatory anti-CD28 antibody was added during co-culture 


splenic dendritic cells (DCs) together with the iNKT-cell hybridoma 24.7. of MODE-K and iNKT cells. Means + s.e.m. of quadruplicate cultures are 
APCs were loaded with o:-GalCer (wGC; 100 ng ml ') before washing and shown. Student’s t-test was applied. Results are representative of two 
co-culture with iNKT cells. c, MODE-K cells (IEC) were transfected with independent experiments. 


control siRNA or siRNA directed against I]10 before co-culture with iNKT cells. 
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Extended Data Table 1 | Genes downregulated in expression in oxazolone-challenged Mttp 


Fold 


Change* 


8.721 
3.878 


3.7 
3.661 


3.534 


3.347 
3.347 


3.259 
3.216 
3.216 
3.178 
3.162 


3.134 
3.12 


3.101 
3.068 


3.057 
3.034 


3.024 


GeneID 


Ifi202b 
Eya3 


Sipatl1 
Gna14 


Cd79b 


Mapk14 
Aasdhppt 


Marcks 
Tgtp 
Rbmx 
Tel1b3 
Clu 


Indo 
Nel 


Hspa4 
Gbp2 


Nebp2 
Mem5 


Hsph1 


AIEC mice 


Description 


interferon activated gene 202B 
eyes absent 3 homolog (Drosophila) 


signal-induced proliferation-associated 1 like 1 
AV230778 RIKEN full-length enriched, 0 day neonate 
skin Mus musculus cDNA clone 4632401H08 3', 
mRNA sequence. 


CD79B antigen 


mitogen activated protein kinase 14 
aminoadipate-semialdehyde dehydrogenase- 
phosphopantetheiny| transferase 


myristoylated alanine rich protein kinase C substrate 
T-cell specific GTPase 

RNA binding motif protein, X chromosome 

T-cell leukemia/lymphoma 1B, 3 

clusterin 


indoleamine-pyrrole 2,3 dioxygenase 

uy94h11.x1 NCI_CGAP_Mam5 Mus musculus cDNA 
clone IMAGE:3667269 3' similar to 
SW:NUCL_MOUSE P09405 NUCLEOLIN ; mRNA 
sequence. 

heat shock protein 4 

guanylate nucleotide binding protein 2 

nuclear cap binding protein subunit 2 
minichromosome maintenance deficient 5, cell 
division cycle 46 (S. cerevisiae) 


heat shock protein 110 


* Fold change: Mttp** / Mtto“"=° 


Genes downregulated by more than threefold in intestinal mucosal scrapings of Mttp’ 


AIEC mice compared with wild-type littermates 6 h after rectal challenge with oxazolone. 
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Mfsd_2a is a transporter for the essential omega-3 
fatty acid docosahexaenoic acid 


Long N. Nguyen', Dongliang Ma’, Guanghou Shui’, Peiyan Wong’, Amaury Cazenave-Gassiot®, Xiaodong Zhang”, 


Markus R. Wenk®, Eyleen L. K. Goh? & David L. Silver! 


Docosahexaenoic acid (DHA) is an omega-3 fatty acid that is essen- 
tial for normal brain growth and cognitive function’ *. Consistent 
with its importance in the brain, DHA is highly enriched in brain 
phospholipids*’. Despite being an abundant fatty acid in brain phos- 
pholipids, DHA cannot be de novo synthesized in brain and must be 
imported across the blood-brain barrier, but mechanisms for DHA 
uptake in brain have remained enigmatic. Here we identify a member 
of the major facilitator superfamily—Mfsd2a (previously an orphan 
transporter)—as the major transporter for DHA uptake into brain. 
Mfsd2a is found to be expressed exclusively in endothelium of the 
blood-brain barrier of micro-vessels. Lipidomic analysis indicates 
that Mfsd2a-deficient (Mfsd2a-knockout) mice show markedly reduced 
levels of DHA in brain accompanied by neuronal cell loss in hippo- 
campus and cerebellum, as well as cognitive deficits and severe anx- 
iety, and microcephaly. Unexpectedly, cell-based studies indicate 
that Mfsd2a transports DHA in the form of lysophosphatidylcho- 
line (LPC), but not unesterified fatty acid, in a sodium-dependent 
manner. Notably, Mfsd2a transports common plasma LPCs carry- 
ing long-chain fatty acids such LPC oleate and LPC palmitate, but 
not LPCs with less than a 14-carbon acyl chain. Moreover, we deter- 
mine that the phosphor-zwitterionic headgroup of LPC is critical 
for transport. Importantly, Mfsd2a-knockout mice have markedly 
reduced uptake of labelled LPC DHA, and other LPCs, from plasma 
into brain, demonstrating that Mfsd2a is required for brain uptake 
of DHA. Our findings reveal an unexpected essential physiological 
role of plasma-derived LPCs in brain growth and function. 
Immunolocalization of Mfsd2a indicates that it is highly enriched in 
brain micro-vessels, where it is exclusively found in endothelium con- 
stituting the blood-brain barrier (BBB) (Fig. 1a, b, c, and Extended Data 
Fig. 1a, b), but not expressed in pericytes that enwrap the endothelium*” 
(Extended Data Fig. 1c, d). This localization pattern in the BBB was also 
noted in monkey (Extended Data Fig. le-g). Mfsd2a was found to be 
expressed in the BBB at embryonic day 15.5 (E15.5) (Extended Data Fig. 2a). 
This pattern of localization suggests a transport function in the BBB. 
Male and female Mfsd2a-deficient (Mfsd2a-knockout) mice were born 
at Mendelian ratios, but had significantly increased postnatal mortal- 
ity early in life’. Consistent with this, fetal and placental weights of 
knockout mice at E18.5 were similar to wild-type littermates (Extended 
Data Fig. 2b). In addition, knockout mice after weaning exhibited motor 
dysfunction” with front-paw clasping during tail suspension (Extended 
Data Fig. 2c). Brain size and weight of knockout mice was also signifi- 
cantly lower for knockout mice than wild-type littermates (Extended Data 
Fig. 3a, b). However, there were no visible differences in gross anatomy 
of brains from knockout and wild-type mice (Extended Data Fig. 3c, d). 
Interestingly, cerebellum of knockout mice exhibited a significant loss 
of Purkinje cells (Fig. 1d, e). Furthermore, there was a significant decrease 
in neuronal cell density in the hippocampus, particularly in the CA1 and 
CA3 regions, but normal cell density in the dentate gyrus of knockout 
mice (Fig. 1f, g). Behavioural tests indicated that knockout mice had 


WT KO 


@ NeuN Mfsd2a Merge NeuN Mfsd2a Merge 


b GFAP Mfsd2a Merge 


GFAP 


© 


Mfsd2a 


Purkinje cel 


9 140,000- _ 3,000 

E E = 

E 8,000 E E 

8 - 820004] | 8 

5 6,000 4 5 5 6,000- 
£ = £ 

§ 4,000-4 6 § 4,000- 
2 5 1,000- g 

oO oO 

© 2,000- o © 2,000-4 
x < g 

re) oO a 

o- ot ol, 
WT KO WT KO WT KO 


Figure 1 | Localization of Mfsd2a in the blood-brain barrier and neuronal 
deficits in Mfsd2a-knockout mice. a, Mfsd2a (red) is highly enriched in brain 
micro-vessels, shown here in CA1 region with NeuN staining (green) of mature 
neurons. b, c, Mfsd2a is expressed in endothelial cells of the blood-brain 
barrier in close contact with endfeet of astrocytes as shown by GFAP staining. 
d, Loss of Purkinje cells detected by parvalbumin (Pvalb) staining in cerebellum 
of knockout (KO) mice. e, Quantification of Purkinje cells in the cerebellum 
of wild-type (WT) and knockout mice. f, NeuN staining in hippocampus of 
sagittal brain sections of 8-week-old wild-type and knockout mice indicates 
decreased mature neurons in specific hippocampal regions of knockout mice. 
g, Quantification of neuron numbers in CA1, CA3 and dentate gyrus (DG) 
regions from mice examined in f above. ***P < 0.001. Data are expressed as 
mean + s.e.m. 
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Figure 2 | Brains of Mfsd2a-knockout mice are DHA-deficient. 

A comprehensive lipidomic analysis of brain, liver and heart phospholipids of 
adult Mfsd2a-knockout and wild-type mice. a, Heatmap representation of 
percentage of individual phospholipid species measured from brain, liver 

and heart. DHA- and arachidonic acid (AA)-containing species are 
highlighted in red and blue, respectively. e, ether; p, plasmalogen; LysoPC, 
lysophosphatidylcholine; LysoPI, lysophosphatidylinositol; LysoPE, 
lysophosphatidylethanolamine; PC, phosphatidylcholine; PE, 
phosphatidylethanolamine; PI, phosphatidylinositol; PS, phosphatidylserine. 
b-g, Total DHA levels in brain, liver and heart phospholipids (b, d, f; wild-type, 
n= 5; knockout, n = 4), and total AA levels in brain, liver and heart 

(c, e, g). ***P < 0.0001 DHA and AA levels are expressed as mean + s.e.m. of 
the percentage of the total level of phospholipids (see the Source Data 
associated with this figure for the full data set). 


severe deficits in learning, and short- and long-term memory, as well 
as severe anxiety (Extended Data Fig. 4). 

The cognitive and behavioural dysfunction of Mfsd2a-knockout mice 
is reminiscent of omega-3 fatty-acid deficiency’. We performed a com- 
prehensive lipidomic analysis of brain, liver and heart by mass spectrom- 
etry on wild-type and knockout mice. Notably, we found that DHA, but 
not other fatty-acid species, in major phospholipid species of phospha- 
tidylethanolamine (PE), phosphatidylcholine (PC), phosphatidylino- 
sitol (PI) and phosphatidylserine (PS) of brain of knockout mice were 
significantly decreased compared to wild-type mice (Fig. 2a). DHA is 
found mainly as phospholipid species 38:6 and 40:6 in PE, PC, PI and 
PS" (Fig. 2a). The total level of DHA-containing species was not signi- 
ficantly different in the liver and heart of knockout mice compared to 
wild-type mice (Fig. 2d, f). However, the total level of DHA in brain of 
knockout mice was reduced by 58.8% with minor changes in other fatty- 
acid species (Fig. 2b). Brains of knockout mice had a 33.8% increase in 
arachidonic acid in phospholipids (Fig. 2c), which is commonly increased 
in rodent models of DHA deficiency™. Arachidonic acid levels in liver 
and heart were unchanged (Fig. 2e, g). It is noteworthy that knockout 
mice were grown on DHA sufficient diet underscoring a physiological 
role of Mfsd2a in maintaining brain DHA levels. Despite the apparent 
lack of anatomical changes in the developing brains of Mfsd2a-knockout 
embryos, biochemical changes in brain phospholipids were apparent at 
E18.5 with significantly reduced levels of DHA in phospholipids (Extended 
Data Fig. 2d), indicating that Mfsd2a plays an essential role in maintain- 
ing DHA levels during embryogenesis. 

The brain selectively accumulates a high amount of plasma-derived 
DHA. In plasma, the exchangeable pool of plasma DHA is found in albu- 
min as either unesterified fatty acid or as LPC’. Using cell based assays, 
we found that Mfsd2a did not transport unesterified DHA, or other unes- 
terified fatty acids (Extended Data Fig. 5a, b, and Supplementary Table 1). 
Remarkably, cells expressing Mfsd2a exhibited an enhanced concentration- 
dependent uptake of LPC ['*C]DHA relative to control cells (Fig. 3a), 
indicating that Mfsd2a is indeed a LPC DHA transporter. Alanine muta- 
genesis of the phylogenetically conserved residues aspartate 92 (Asp92Ala) 
and 96 (Asp96Ala) critical for sodium binding in MFS proteins* resulted 
in reduced and absence of transport, respectively (Fig. 3a). Asp92Ala 
and Asp96Ala mutants had similar expression as wild type (Extended 
Data Fig. 5c, d). We next examined transport specificity of Mfsd2a for 
the most common LPCs found in plasma, LPC oleate and LPC palmitate. 
Cells expressing wild-type Mfsd2a showed concentration-dependent uptake 
of LPC ['*C]oleate and LPC [*H] palmitate (Fig. 3b, c). Human Mfsd2a 
also transported LPCs in a similar manner (Extended Data Fig. 5e-h). 
Mfsd2a had highest capacity to transport LPC DHA, followed by LPC 
oleate, and LPC palmitate (Fig. 3d). Moreover, cells expressing wild-type 
and partially active Asp92Ala mutant exhibited saturation kinetics over 
time (Extended Data Fig. 5i). Importantly, Mfsd2a-dependent transport 
of LPC ['*C]DHA and LPC ['*C]oleate resulted in their rapid conver- 
sion into phosphatidylcholines (Fig. 3e-h). Moreover, transport by Mfsd2a 
resulted in increases in phosphatidylcholine mass, indicating net uptake 
of LPC into cells (Extended Data Fig. 5)). 

LPC transport by Mfsd2a was sodium-dependent (Fig. 3i). Consistent 
with the characteristics of other sodium-dependent MFS symporters, 
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Figure 3 | Cell-based transport assays of radiolabelled LPCs. 

a-c, Concentration-dependent transport of LPC [‘4C]DHA, LPC ["“C]oleate 
and LPC [° H]palmitate after 30 mins. Mouse Mfsd2a (wild-type) and mutant 
constructs D92A and D96A were tested for uptake of radiolabelled LPCs at 
indicated concentrations. d, Comparison of transport preference of LPC 
['“C]DHA, LPC ['4C]oleate and LPC [?H]palmitate. e, Biological 
incorporation of radiolabelled LPC [\“C]DHA. f, Quantification of PC bands 
from thin-layer chromatography (TLC) plates shown in e. g, LPC ["4C]oleate 
bio-incorporation into phosphatidylcholine. h, Quantification of PC bands 
from TLC plates shown in g. i, Transport activity of mouse Mfsd2a is dependent 
on sodium. Ch‘, choline. j, Dose-response curve for sodium-concentration 
dependency of transport of 50 .M LPC ['4C]oleate by Mfsd2a. ***P < 0.0001. 
For a-d, f, h-j, data in triplicates are expressed as mean = s.e.m. 


the transport is highly sensitive to low sodium concentrations, indicat- 
ing high affinity for sodium (Fig. 3j)'’. LPC transport was not depen- 
dent on pH or lithium (Extended Data Fig. 5k, 1). In addition, both LPC 
[7H] 16:0 solubilized in ethanol or in micelles was transported by Mfsd2a, 
albeit with lesser capacity compared with bovine serum albumin (BSA)- 
bound form, indicating that albumin is not essential for transport (Extended 
Data Fig. 5m). Our findings reveal Mfsd2a as the first facilitative trans- 
porter identified for the transport of phospholipids. 

We next sought to determine ligand specificity and the chemical fea- 
tures of the LPC ligands that are required for transport by Mfsd2a. To 
carry out this goal, we set up competition assays using cells expressing 
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Mfsd2a or mock control cells, and treated them with 25 uM LPC [?H] 
palmitate in the presence or absence of a tenfold molar excess of unla- 
belled competitor. We found that a LPC with a minimum acyl chain 
length of 14 carbons can effectively compete for uptake (Extended Data 
Fig. 6a-c). The lysophospholipids LPE and LPS showed weak com- 
petition, whereas LPA was non-competitive for the transport of LPC 
[*H] palmitate (Extended Data Fig. 6c). We confirmed that Mfsd2a can 
directly transport LPE using fluorescent LPE (Extended Data Fig. 7). 
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Figure 4 | Uptake of radiolabelled LPCs by brain is decreased in 
Mfsd2a-knockout mice. a, b, Male mice of 6-7 weeks of age were injected 
intravenously with the same dose of LPC [‘4C]DHA (a) and LPC ['*C]oleate 
(b). Brain, liver and heart were collected 2 h post injection for lipid extraction 
and DPM was quantified using scintillation counting. Uptake is expressed 

as mean ~ s.e.m. (wild-type, n = 5; knockout, n = 5). c, Structure of 
fluorescence NBD LPC. d, HEK293 cells expressing wild-type Mfsd2a showed 
significantly enhanced uptake activity to NBD LPC compared with mock 
(empty plasmid), D92A and D96A mutant. e, TLC analysis showed that NBD 
LPC was bio-incorporated into PC. Transport of NBD LPC was suppressed 
by tenfold molar excess of LPC 18:0 (cold). Mutant D92A, D96A had similar 
transport activity for NBD LPC as mock-transfected cells. f, Quantification 
of PC band from TLC plates shown in e. g, Brain uptake of NBD LPC was 
decreased in knockout mice. Male mice (wild-type, n = 3; knockout, n = 3) 
aged 6 weeks old were intravenously injected with 300 ug NBD LPC-BSA 
complex. h, Fluorescence from 15 brain sections of wild-type and knockout 
mice was quantified and expressed as fluorescence intensity per pixel. 

***P < 0.0001, **P < 0.001, *P < 0.05. Data are expressed as mean + s.e.m. 
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These results indicate that the zwitterionic charge of the phosphatidyl- 
choline headgroup is critical for ligand transport. Furthermore, glycer- 
ophosphatidylcholine was not a competitor (Extended Data Fig. 6c), 
supporting the conclusion that a long acyl chain of the LPC is a requisite 
for ligand transport. Lysoplasmalogen, lysoplatelet activating factor, and 
platelet activating factor were strong competitors, indicating that the car- 
bonyl group of the acyl chain of LPC is not required for transport (Extended 
Data Fig. 6d-f). Lysosphingomyelin also competed for LPC ["H] palmitate 
transport, indicating that the glycerol backbone is not required for ligand 
function (Extended Data Fig. 6f). Similar to LPA, sphingosine-1-phosphate 
was not a competitor (Extended Data Fig. 6f), further supporting the con- 
clusion that the choline portion of the phosphocholine headgroup is essen- 
tial for ligand transport. Using lysolipid-like detergents we confirmed 
that an alkyl side chain of a minimal length of 14 carbons and the phos- 
phocholine headgroup, but not the glycerol backbone and carbonyl group, 
are essential chemical features of the LPC ligand (Extended Data Fig. 6g, h). 

Remarkably, Mfsd2a-knockout mice had a reduction in brain uptake 
of LPC ['“C]DHA and LPC ['*C]Joleate by 90% and 80%, respectively 
(Fig. 4a). LPC ['4C]DHA or LPC [“C]oleate uptake in peripheral tis- 
sues of knockout mice were not decreased (Fig. 4a, b). Brain uptake of 
unesterified ['*C]DHA was significantly lower relative to LPC [*C]DHA 
uptake, but was not reduced in knockout relative to wild-type mice 
(Extended Data Fig. 8), indicating brain accretion of DHA is mainly 
through LPC DHA uptake via Mfsd2a. 

In a separate approach to examining in vivo transport of LPCs by 
Mfsd2a, we tested the transport of nitrobenoxadiazole (NBD) LPC, a 
fluorescent LPC analogue (Fig. 4c). Similar to native LPCs, NBD LPC 
was transported by Mfsd2a and bio-incorporated into phospholipids 
(Fig. 4d-f). Intravenous injection of NBD LPC into knockout and wild- 
type mice resulted in a significant reduction in the fluorescence in knock- 
out brain sections (Fig. 4g, h). Similar findings were obtained using 
TopFluor LPC, a fluorophore that is structurally different to NBD LPC 
(Extended Data Fig. 9). 

Dietary DHA treatment did not rescue knockout phenotypes, such 
as brain size and anxiety (Extended Data Fig. 10a—e). Lack of rescue can 
be explained by a dramatic reduction in embryonic brain uptake of die- 
tary DHA at E18.5 (Extended Data Fig. 10f), consistent with reduced 
embryonic brain DHA levels (Extended Data Fig. 2d). 

Taken together, the current study identifies the orphan transporter 
Mfsd2a as a sodium-dependent LPC symporter, and represents the major 
mechanism by which DHA enters the brain, and indicates for the first 
time an important physiological role of plasma-derived LPCs for normal 
brain growth and function. Given these findings, we propose to rename 
Mfsd2a to sodium-dependent LPC symporter 1 (NLS1). 


METHODS SUMMARY 

Animals. Mfsd2a-knockout mice were generated as described previously’. Exper- 
imental protocols were approved by the SingHealth Institutional Animal Care and 
Use Committee. 

Lipidomic analysis. Phospholipid species were measured by high performance liquid 
chromatography (HPLC) 1100 system (Agilent) coupled with a Triple Quadrupole 
Ion Trap mass spectrometer (4000Qtrap, Applied Biosystem). Levels of individual 
phospholipid species were analysed as described previously”®. 

In vitro transport of radiolabelled and fluorescent LPCs. Uptake assays of radio- 
labelled LPCs or fluorescent LPCs were tested using HEK293 cells overexpressing 
Mfsd2a and mutants constructs. For sodium dependent assay, radiolabelled LPCs 
were diluted in transport buffer (5 mM KCl, 10 mM Hepes, pH 7.4) with 150 mM 
NaCl or 150 mM choline chloride. 

In vivo transport of radiolabelled, fluorescent LPCs and unesterified DHA. 
Male and female mice were intravenously injected with 1 jumol of radiolabelled LPC- 
BSA complex. At 2 h post injection, mice were perfused with PBS containing 0.5% 
BSA. Tissues were collected for lipid extraction, and scintillation counted. Similar 
experiments were also performed with an injection of 300 ug NBD LPC-BSA complex 
or 300 pig TopFluor LPC-BSA complex per mouse. To study fetal transport, pregnant 
females at E18.5 was gavaged with bolus of ["*C]DHA-BSA complex. Brains of fetuses 
were collected 20 h post gavage and weighed, and lipid extracts were scintillation counted. 
Statistical analysis. Statistical differences of DHA and amino acid levels, and his- 
tological analyses between genotypes were calculated using an unpaired Student’s 
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t-test. Statistical analysis of LPC ['*C]DHA, LPC [*H]palmitate, LPC ['*C]oleate 
signals were calculated using two-way analysis of variance (ANOVA); a P < 0.05 was 
considered to be significant. For the behavioural tests, a one-way ANOVA was used 
to analyse the zero maze, light-dark box, open field and Y maze. A two-way ANOVA 
was used to analyse novel object test. Bonferroni corrected pair-wise comparisons 
were used as post-hoc tests. Data were expressed as mean + s.e.m. and P< 0.05 
was considered statistically significant. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Chemicals. Non-radiolabelled lysophosphatidylcholines, lysophosphatidyletha- 
nolamine, lysophosphatidylserine and other lysophospholipids were purchased 
from Avanti Polar Lipids. Radiolabelled 1-palmitoyl 2-lysophosphocholine (LPC 
[’H]palmitate), 1-oleoyl 2-lysophosphocholine (LPC ['*C]oleate), 1-docosahexaenoyl 
2-lysophosphocholine (LPC [‘*C]DHA), and [1-'4C]docosahexaenoic acid ({4C] 
DHA) were purchased from Americans Radiochemicals. Lysophospholipids either 
in chloroform (non-labelled) or ethanol:toluene (radiolabelled) were completely dried 
under the nitrogen gas and were solubilized in 12% fatty acid-free BSA (Sigma), 
which was dissolved in 150 mM NaCl. To prepare LPC [‘*C]-oleate and LPC oleate 
mixture, 25 |1Ci LPC ['*C]oleate (specific activity 55 mCi mmol ') were dried and 
dissolved in 3 ml 20 mM non-labelled LPC stearate-BSA. LPC [°H] palmitate) and 
LPC palmitate mixture were prepared by dissolved 25 .Ci LPC [*H] palmitate (spe- 
cific activity 60Ci mmol’) in 4 ml 10 mM LPC palmitate. 1-docosahexaenoyl LPC 
was prepared from hydrolysis of 1,2-didocosahaxaenoyl PC with honey bee venom 
PLA2 (Sigma) in borax buffer (pH 8.5) containing 5 mM CaCl2 and purified by 
TLC methods. To prepare LPC ['*C]DHA and LPC DHA mixture, 10 Ci LPC 
['*C]DHA was mixed with non-labelled LPC DHA-BSA complex toa final concen- 
tration of 10 mM. To prepare [““C]DHA and DHA mixture, 50 Ci [‘*C] DHA was 
mixed with non-labelled DHA-BSA complex to a final concentration of 12.2 mM. 
DHA oil (HiDHA) was a gift from NuMega. 

Animals. Mfsd2a-knockout mice were generated as described previously’®. Mice were 
maintained on a high-energy diet 5LJ5 (PicoLab) containing a total 11% fat. Pups 
were weaned at 3 weeks old and maintained on the high-energy diet. Experimental 
protocols were approved by the SingHealth Institutional Animal Care and Use Com- 
mittee. Mice in all experiments were age- and gender-matched, and not randomized 
into groups. Experiment involving mice were not carried out with the experimenter 
blinded. Samples sizes used in animal experiments were based on empirical deter- 
mination in preliminary experiments. 

Lipidomic analysis. For tissue lipid analysis, brain, liver and heart of female wild- 
type and knockout mice aged 7-8 weeks born to Mfsd2a heterozygous (HET) mothers, 
and brains from E18.5 embryos were collected and immediately frozen with liquid 
nitrogen until extraction. Lipid extraction was followed by the chloroform-methanol 
method. For cell-based free fatty-acid transport, lipids from HEK293 cells expres- 
sing mouse Mfsd2a and mock controls after overnight treatment with 100 1M doco- 
sahexaenoic, arachidonic, eicosapentaenoic, alpha-linolenic, linoleic, oleic, palmitic 
acids in BSA complex were extracted using HIP (hexane:iso-propanol 3:2, per volume) 
buffer and dried under N, gas. Phospholipid species were measured by high per- 
formance liquid chromatography (HPLC) 1100 system (Agilent) coupled with an 
Triple Quadrupole Ion Trap mass spectrometer (4000Qtrap, Applied Biosystem). 
Levels of individual phospholipid species were analysed using MRM as previously 
described'* and quantified using spiked internal standards that includes dimyris- 
toyl PC, dimyristoyl PE, dimyristoyl PS (Avanti Polar Lipids, Alabaster, Alabama, 
USA), and dioctanoyl PI (Echelon Biosciences, Salt Lake City, Utah, USA). 

In vitro transport of radiolabelled and fluorescent LPCs. Radiolabelled LPC 
palmitate (LPC [*H] palmitate), oleate (LPC ['*C]oleate), and docosahexaenoate 
(LPC ['*C]DHA) or TopFluor LPC, TopFluor LPE, and NBD LPCs were dissolved 
in 12% BSA, which was diluted in 150 mM NaCl. Uptake assays of radiolabelled 
LPCs or fluorescent LPCs were tested using HEK293 cells overexpressing Mfsd2a 
and mutants constructs. In brief, HEK293 cells at 90-95% confluency were trans- 
fected using lipofectamine 2000 (invitrogen) with pcDNA3.1Mfsd2a (wild-type), 
pcDNA3.1Mfsd2aD92A (Asp92Ala), pCDNA3.1Mfsd2aD96A (Asp96Ala), or pCDNA3.1 
(mock) plasmids. Uptake assays were performed after 24 h of transfections. Prior 
to ligand incubation, HEK293 transfected cells were washed with serum-free DMEM 
before assays. For concentration and time dependent assays, radiolabelled LPCs were 
diluted in pre-warmed DMEM media. For sodium dependent assay, radiolabelled 
LPCs were diluted in transport buffer (5 mM KCl, 10 mM Hepes, pH 7.4) with 150 mM 
NaCl or 150 mM choline chloride. For sodium concentration dependent assays, 
any reductions in the concentration of NaCl were replaced by choline chloride in 
order to maintain a constant cation molarity of 150 mM. All assays were performed 
in 12-well plates with triplicates and at 37 °C. 

For albumin-free ligand transport assays. To prepare LPC palmitate dissolved in 
ethanol, 0.75 .Ci LPC [*H] palmitate were diluted in LPC palmitate in chloroform. 
Mixture was dried and dissolved in 50 ll ethanol before adding 6ml of transport 
buffer with 150 mM sodium as described above to have 50 1M LPC palmitate. To 
prepare LPC palmitate micelles, 0.75 ,.Ci LPC [*H] palmitate were diluted in LPC 
palmitate in chloroform. Mixture was dried and dissolved in 6 ml of transport 
buffer with 150 mM sodium to have 100 uM LPC palmitate and sonicated on ice 
for 5 min. Activated charcoal was added and spun to remove the monomers of 
LPC palmitate. Transport assays were similar performed with HEK293 cells over- 
expressing with pcDNA3.1Mfsd2a or pcDNA3.1plasmid as control for 30 mins at 
37 °C. 
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Competition transport assay. In brief, 24 h post transfection of HEK293 cells 
with pcDNA3.1Mfsd2a (wild-type), pcDNA3.1Mfsd2aD92A (Asp92Ala), pcDNA3.1 
Mfsd2aD96A, or pcDNA3.1 (mock) plasmids were washed once with serum-free 
DMEM medium before addition of a mixture of 25 UM radiolabelled LPC palmi- 
tate and 250 UM cold competitors, which were dissolved in 12% BSA. Total BSA 
concentration was kept constant in samples with or without cold competitors. Assays 
were performed at 37 °C for 30 min. Competition assays with other LPC analogs such 
as foscholine detergents, and PAF were performed under the same conditions except 
for 15 min. The reduced reaction time was necessary in order to limit potential neg- 
ative effects of detergents and bioactive lipids on cell survival. All assays were per- 
formed in 12-well plates with triplicates and at 37 °C. 
In vivo transport of radiolabelled, fluorescent LPCs and unesterified DHA. 
Male and female mice of age of 6-8 weeks were intravenously injected with 75 tl 
of 20 mM radiolabelled LPC ['*H]oleate, 100 ul of 10 mM LPC [*C]DHA-BSA 
complex or 82 pil of 12.2 mM ['*C]DHA-BSA complex in a total volume of 150 ll 
of PBS. At 2 h post injection, mice were anaesthetized and perfused for 5 min with 
PBS containing 0.5% BSA in order to remove blood and lipid tracer bound to the 
brain vasculature. Tissues were collected for lipid extraction. For lipid extraction, 
tissues were weighed and similar amounts of tissues were homogenized in chloro- 
form:methanol. Lipids from the organic phase were mixed with scintillant and scin- 
tillation counted. Similar experiments were also performed with an injection of 300p1g 
NBD LPC-BSA complex or 300 1g TopFluor LPC-BSA complex per mouse. In 
this experiment, mice were perfused for 5 min with PBS followed 15 min perfusion 
with 4% paraformaldehyde (PFA) for tissue fixation. Brain sections of wild-type 
and knockout with a thickness of 40 um were prepared and scanned using green 
fluorescence mode in Typhoon FLA 9000 scanner (Agilent). NBD LPC accumula- 
tion was expressed as fluorescence of each section per its area. Dietary DHA transport 
from mother to fetus was also measured using ['*C] DHA-BSA complex. Pregnant 
females at E18.5 was gavaged with bolus of [‘*C] DHA-BSA complex (200 pil of 10 mM 
['*C]DHA-BSA per mouse). Brains of fetuses were collected 20 h post gavage and 
weighed. Lipid extraction and radioactivity were performed as described above. 
TLC analysis of phospholipids. HEK293 cells overexpressing with pcDNA3.1Mfsd2a 
(wild-type), pcCDNA3.1Mfsd2aD92A (Asp92Ala), pcCDNA3.1Mfsd2aD96A, or pcDNA3.1 
(mock) plasmids or human Mfsd2a (in Sport6 plasmid) were washed once with 
serum-free DMEM medium before incubation with 50 UM radiolabelled LPC ['*C] 
oleate, LPC ['*C]DHA or with 50 1M TopFluor LPC and NBD LPC. Lipids were 
extracted twice with HIP buffer for 30 min. Lipids were dried with nitrogen stream 
and reconstituted in chloroform and spotted on TLC plates (Milipore). Solvent for 
phospholipid separation was a solution of chloroform, methanol and ammonia 
(25%) (50:25:6 per volume). Thin-layer chromatography (TLC) plates of radio- 
labelled phospholipids were dried for 30 min and exposed overnight to Phos- 
phorscreens and scanned with Typhoon FLA 9000 scanner (Agilent). TLC plates 
of fluorescent phospholipids were scanned with Typhoon FLA 9000 scanner and 
quantified using Imagequant software. 
Histological studies. Adult male wild-type and knockout mice aged 7.5-8 weeks 
old born to HET parents, were deeply anaesthetized and perfused transcardially 
with 50 ml of saline, followed by 100 ml of 4% PFA in 0.1 M PBS (pH 7.4) for 30 
min. For embryos, brains were fixed in 4% PFA then 30% sucrose both overnight. 
Sagittal sections of 40 jum thickness were sectioned in a cryostat, and serial sections 
were transferred to different wells of a 24-well tissue-culture dish for immunostain- 
ing. Sections of embryos were embedded on slides. The brain sections were processed 
for immunocytochemistry procedure using antibodies against NeuN (1:1000, Che- 
micon, California, USA), Mfsd2a (1:500), GFAP (1:1000, Chemicon, California, USA), 
parvalbumin (1:500, Swant, Switzerland), Glut1 (1:500, Abcam), PDGFR& (1:150 
eBioscience) and incubated for 5 min in 4’ ,6-diaminodino-2-phenylindole (DAPI, 
1:5000) before washing and mounting. The images were acquired on Zeiss LSM 
710 inverted fluorescence confocal microscope (Carl Zeiss, Singapore). The same 
localization procedures were performed on hippocampal sections from the Macaca 
fascicularis. NeuN-immunostained neuronal profiles in the hippocampus and 
parvalbumin-stained Purkinje cells in cerebellum were counted and indicated as 
number per mm? (number of immunopositive neurons per square millimetre (num- 
ber per mm”) and number of immunopositive cells per millimetre (number per mm”) 
in mean + s.e.m. Statistical significance was assessed using the Student’s t-test. 
Behaviour studies. For the Y-maze spontaneous-alternation test, mice were placed 
in the centre of a Y-shaped maze with three arms at a 120° angle from each other. 
The animal is allowed to freely explore all three arms. If the mouse has an intact 
spatial working memory, it would have a reduced tendency to enter the recently 
visited arm. The Topscan program (Cleversys, Reston, Virginia) was used to score 
the number of arm entries, and determine the number of alternations. A mouse is 
considered as having entered an arm when all four limbs are within the arm. 
For the novel-object recognition test, the novel-object recognition test was carried 
out as described previously’. In brief, mice were trained with identical ‘familiar’ 
objects for 5 min and then assessed for short-term memory (STM) and long-term 
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memory (LTM), which took place 20 min and 24 h after training, respectively. The 
bouts of exploration for each object were scored using the Annostar program (Cleversys, 
Reston, Virginia). Preference score was calculated as (time spent with novel object 
— time spent with familiar object) + (total time spent with both objects). Prefer- 
ences for familiar object and novel object were defined as negative scores and scores 
approaching zero showed no preference for either object. 

For anxiety tests, all behavioural apparatus were cleaned between each animal 
with surface disinfectant and 70% ethanol. In the zero maze, mice were placed in the 
closed arm and allowed 10 min for exploration. Behaviours, such as the time spent 
in, number of entries into, number of transitions between and latency to enter the 
open arms, were scored using the Annostar behaviour scoring program (Cleversys, 
Reston, Virginia, USA). In addition, exploratory behaviours, such as head dips and 
stretch attends, were recorded. At the start of the light-dark box test, mice were 
placed into the dark box measuring 20 X 40 X 16 cm, and allowed to move freely 
between the dark and light boxes for 10 min. Behavioural measures, such as time 
spent and horizontal activity in the light box, latency to enter the light box and the 
number of transitions between the two boxes, were recorded using the Versamax 
program (AccuScan Instruments, Columbus, Ohio, USA). 


For open-field activity, mice were placed into the chamber for 60 min and the 

total distance travelled, number of rears, and time spent in the corners and centre 
were recorded using the Versamax program (AccuScan Instruments, Columbus, 
Ohio, USA). 
Statistical analysis. Statistical differences of DHA and AA levels, and histological 
analyses between genotypes were calculated using an unpaired Student’s t-test. 
Statistical analysis of LPC [(\*C]DHA, LPC (’H] palmitate, LPC ['*C]oleate signals 
expressed as DPM between mock versus wild-type and mutants were calculated 
using two-way ANOVA; a P < 0.05 was considered to be significant. 

For the behavioural tests, genotype was the between subject factors. A one-way 
ANOVA was used to analyse the zero maze, light-dark box, open-field and Y-maze. 
A two-way ANOVA was used to analyse novel object test, with test day as the between 
subjects factor. Bonferroni corrected pair-wise comparisons were used as post-hoc 
tests. Data was expressed as mean + s.e.m. and P < 0.05 was considered statisti- 
cally significant. 


19. Wong, P. et al. Pregnenolone rescues schizophrenia-like behavior in dopamine 
transporter knockout mice. PLoS ONE 7, e51455 (2012). 
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a DAPI Glut Mfsd2a 


Mfsd2a 


Extended Data Figure 1 | Mfsd2a is highly expressed in endothelium of 
micro-vessels in brain. a, Expression of Mfsd2a in endothelium is co-localized 
with glucose transporter Slc2a1 (Glut1). Arrowheads show endothelial cells in 
blood brain vessels. b, Mfsd2a is highly expressed in micro-vessels in brain, 
shown here are sections in dentate gyrus regions. c, Mfsd2a and the pericyte 
marker PDGFR& co-localize in brain microvasculature. d, Mfsd2a is not 
expressed in pericytes. Yellow and red arrowheads indicate endothelial cells and 
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Glut1 


a pericyte, respectively. Similar expression pattern of Mfsd2a is found in 
endothelium of micro-vessels in brain of monkey. e, Mfsd2a is highly expressed 
in micro-vessels and is co-localized with glucose transporter Slc2a1 (Glut1) in 
brain, shown here are sections in cerebellum of P4 monkey. f, g, Expression 
of Mfsd2a in endothelium in of brain micro-vessels. Shown is the hippocampal 
region. The marker for astrocytes is GFAP. KO, knockout; WT, wild type. 
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Extended Data Figure 2 | Localization of Mfsd2a at the blood-brain barrier 
of E15.5 fetus and lipid analysis. a, Mfsd2a (red) is highly expressed in 
micro-vessels and is co-localized with glucose transporter Slc2a1 (Glut1, green) 
in fetal brain. Scale bars 100 jim. b, Placental and fetal weights. Placentas and 
fetuses from two HET pregnant mice (E18.5) crossed with a Mfsd2a-knockout 
male were collected and weighed. There were no significant differences in 
placental and fetal weight between HET (n = 7) and knockout (m = 11) mice. 
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c, Tail suspension was used to test for the presence of the paw-clasping 
phenotype of 10-week-old wild-type and knockout mice. d, Mass spectrometry 
measurement of phospholipids in the E18.5 fetal brain of wild-type (n = 6) 
and knockout (n = 5) mice showed that fetal brains of knockout mice had 
significantly reduced DHA levels, whereas AA levels were increased. 

***P <0).001 (see Source Data accompanying this figure for the full data set). 
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Extended Data Figure 3 | Mfsd2a-knockout mice exhibit microcephaly. Mfsd2a polyclonal antibody to visualize expression of Mfsd2a. Mfsd2a is shown 
a, A representative image of brains of two 8-week-old wild-type and knockout __ to be widely expressed in brain. Scale bars, 1 mm. d, Haematoxylin and 
littermates. b, The brain weight of knockout (1 = 4) mice is significantly lower _ eosin (H&E) staining of hippocampus region of 8-week-old wild-type and 
than that of wild-type (n = 4) littermates. c, Gross morphology of brains knockout mice, indicating a smaller hippocampus in knockout mice. Scale bars, 
and sagittal sections of brains. Sagittal brain sections of 8-week-old wild-type 500 um. ***P < 0.001. Data are expressed as mean + s.e.m. 

and knockout mice were stained with NeuN to visualize neuronal cells and 
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Extended Data Figure 4 | Mfsd2a-knockout mice exhibit deficits in learning 
and memory, and severe anxiety. a—d, The Y-maze test (a, b) and 
novel-object recognition test (c, d) were used to assess spatial learning, 
short-term memory (STM) and long-term memory (LTM) of the wild-type and 
knockout mice, respectively. Knockout mice exhibited significantly decreased 
total arm entries in a Y-maze test for spatial working memory. Knockout mice 
showed significantly reduced preferences for novel objects in novel-object 
recognition tests, indicative of defects in short-term memory and long-term 
memory, respectively. Train, the training period. e-I, Zero-maze tests (e-h) and 
light-dark box tests (i-l) were used to assess anxiety of the wild-type and 
knockout mice, respectively. Knockout mice showed decreased transitions and 
head dips into open arms during Zero-maze test for anxiety behaviours. 


Knockout mice showed decreased entry into light box and increased latency to 
enter light box during light-dark box tests for anxiety. m-o, Open-field test for 
activity. Knockout mice showed reduced travel distance in the open-field 

test for locomotor activity. During the open-field test, knockout mice had no 
vertical activity indicative of motoric dysfunction, and decreased time spent in 
the centre, indicative of reduced exploration compared to wild-type mice. 
The increased time spent in the corners of the open field suggests that knockout 
mice were more anxious than wild-type mice, and are congruent with 

our results from the Zero-maze and light-dark box tests. Wild-type mice 

(n = 11-13) and knockout mice (n = 8-10). ***P < 0.001, **P < 0.01, 

*P < 0.05. Data are expressed as mean + s.e.m. 
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Extended Data Figure 5 | Human and mouse Mfsd2a transport LPCs but 
not unesterified fatty acids. a, Thin-layer chromatography (TLC) analysis of 
phospholipids and neutral lipids of HEK293 cells transfected with mouse 
Mfsd2a and human Mfsd2a after overnight incubation with 100 1M 
[‘*C]DHA. Std, unesterified [‘*C]DHA. Experiments were repeated three times 
with triplicates. b, TLC analysis of phospholipids and neutral lipids of HEK293 
cells transfected with mouse Mfsd2a and mutants after overnight incubation 
with 100 1M ['*C]oleate. The TLC protocol used was described in the Methods. 
CE, cholesteryl ester; PC, phosphatidylcholine; PE, phosphatidylethanolamine; 
TAG, triglyceride. c, Localization of Mfsd2a, D92A and D96A at the plasma 
membrane (red). d, Western blot analysis of expression of Mfsd2a, D92A and 
D96A in HEK293 cells post 24 h transfection. e, Biological incorporation 

of radiolabelled LPC ['*C]DHA into PC. f, Biological incorporation of 
radiolabelled LPC['*C]oleate into PC. Cells expressing human (e, f) or mouse 
(f) Mfsd2a were incubated with LPC ['*C]DHA or 50 uM LPC ["C]oleate. 
Lipids were extracted from cells after 30 min incubation with LPC ['*C]DHA 
and 120 min incubation LPC ['*C]oleate and analysed using TLC method for 
resolving PC and LPC. Experiments were repeated two times with triplicate. 
g, h, Dose-dependent transport of LPC [\4c]DHA (g) and LPC ['*C]oleate 
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(h) by human Mfsd2a (hMfsd2a) and empty plasmid (mock) expressing 
HEK293. These experiments were performed in triplicate. i, Time-dependent 
transport of 50 1M LPC ['*C]oleate. Mouse Mfsd2a (wild-type) and mutant 
constructs D92A and D96A were tested for uptake of radiolabelled LPCs at 
indicated times. This experiment was performed in triplicate. j, Increased net 
uptake of LPC ligand in cells expressing mouse Mfsd2a. TLC analysis of 
phospholipids of HEK293 cells transfected with mouse Mfsd2a and mutants 
after 1 h post incubation with 100 .M unlabelled LPC oleate. Shown numbers 
are fold changes of PC levels relative to mock. Experiments were repeated three 
times with duplicates. k, Transport activity of mouse Mfsd2a (wild-type), 
D92A, D96A, and mock expressing cells was not significantly different at 
indicated pHs. This experiment was performed in triplicate. 1], Activity of 
Mfsd2a is sodium- but not lithium-dependent. Data were expressed as fold 
change of Mfsd2a expressing cells compared to corresponding mock cells 
treated with the same conditions. This experiment was performed in triplicate. 
m, Transport activity of Mfsd2a is not BSA-dependent as LPC palmitate 
solubilized in either ethanol or micellular form was transported by Mfsd2a, 
albeit to a lower level than with BSA. This experiment was performed in 
triplicate. Data are expressed as mean + s.e.m. 
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Extended Data Figure 6 | Competition assay to determine the ligand 
structures of Mfsd2a. All competition assays were performed using 25 1M 
LPC [ H] palmitate as ligand with or without tenfold molar excess (250 [1M) of 
the indicated competitors. a, The structures of the lipid competitors used in 
b and c. b, Competition assays with indicated acyl chain LPCs. c, Competition 
assay with indicated headgroups. Assays were stopped after 30 min of 
incubation. Competitive activity was expressed as percent to control (activity of 
Mfsd2a without competitor). 6:0, hexanoate; 8:0, octanoate; 10:0, docanoate; 
12:0, laurate; 14:0, myristate; 16:0, palmitate; 18:0, stearate; 18:1, oleate; 

GPC, alpha-glycerylphosphocholine; LPA, lysophosphatidic acid; LPC, 
lysophosphatidylcholine; LPE, lysophosphatidylethanolamine; LPS, 
lysophosphatidylserine. d, Representative structures of bioactive lipid 
competitors used in e and f. e, Competition assay with lysophospholipid forms 
of plasmalogens and platelet activating factor (PAF). This experiment was 
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performed together with b, so that the control and mock shown in b can be used 
for reference. f, PAF and lysosphingomyelin (lysoSM) also showed strong 
competition, whereas sphingosine 1-phosphate (S1P) did not compete for LPC 
[°H]16:0 uptake. Competitive activity was expressed as percent to control 
(activity of Mfsd2a without competitor). g, Representative structure of 
non-biological lysophospholipid analogues foscholine-16 (Miltefosine) with an 
alkyl chain of 16 carbons. h, Competition assays of indicated foscholines with 
LPC [*H]palmitate. Assays were stopped after 15 min of incubation. 
Competitive activity was expressed as percent to control (activity of Mfsd2a 
without competitor). Foscholine with alkyl chain length of 8 (Fos-8), 

10 (Fos-10) and 12 (Fos-12) carbons did not compete, whereas foscholine with 
an alkyl chain length of 16 carbons (Miltefosine) showed strong competition 
with LPC [°H]16:0. These experiments were repeated two times with triplicates. 
***P < 0.001. Data are expressed as mean = s.e.m. 
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Extended Data Figure 7 | Mfsd2a transports TopFluor LPE. Thin-layer of PE band from TLC plate. PE, phosphatidylethanolamine; LPE, 
chromatography (TLC) analysis of phospholipids of HEK293 cells transfected —_ lysophosphatidylethanolamine. Experiments were repeated two times with 
with mouse Mfsd2a and mutants after 30 min post incubation with 25 1M duplicates. Data are expressed as mean = s.e.m. 


TopFluor LPE. a, TLC analysis of phospholipids. b, Quantification of intensity 
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Extended Data Figure 8 | Brain uptake of unesterified ['*C]DHA was not 
reduced in Mfsd2a-knockout mice. Male mice aged 7 weeks old were 
intravenously injected with 1 mmol of ['*C]DHA-BSA complex. Brain, liver 
and heart were collected 2 h post injection for lipid extraction and DPM 
quantified using scintillation counting. a-c, Uptake of unesterified ['*C]DHA 
in the wild-type and knockout brain, heart and liver were expressed as DPM 
per g. d-f, Level of uptake of DHA in a-c was converted into nmol per g. 
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g-i, A comparison between the absolute amount of DHA uptake in the form of 
LPC DHA (converted from Fig. 4 into nmol per g in 2 h) and unesterified DHA 
(taken from d-f above) in brain, heart and liver of wild-type mice. The same 
amounts of LPC DHA and DHA were injected in mice. The amount of LPC 
DHA uptake was far greater than unesterified DHA uptake by wild-type brain. 
***P < 0.0001, *P < 0.05. Data are expressed as mean = s.e.m. (wild-type, 

n = 5; knockout, n = 5). 
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Extended Data Figure 9 | Brain uptake of TopFluor LPC was reduced in duplicates. d, Quantification of PE band from the TLC plate shown in c. 
Mfsd2a-knockout mice. This experiment was carried out as described for e, Brain uptake of TopFluor LPC was decreased in knockout mice. Male mice 


NBD LPC in Fig. 4. a, structure of TopFluor LPC. b, HEK293 cells expressing (wild-type, n = 3; knockout, n = 3) aged 7 weeks old were intravenously 
wild-type Mfsd2a showed significantly enhanced uptake activity to TopFluor —_ injected with 300 jg TopFluor LPC-BSA complex. f, Fluorescence from 10 


LPC compared with mock (empty plasmid), D92A and D96A mutant brain sections of wild-type and knockout mice was quantified and expressed as 
expressing cells. c, TLC analysis showed that TopFluor LPC was fluorescence intensity per pixel. ***P < 0.001. Data are expressed as 
bio-incorporated into PE. Experiments were repeated two times with mean + s.e.m. 
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Extended Data Figure 10 | Dietary DHA supplementation failed to rescue 
Mfsd2a-knockout phenotypes. Heterozygous female mice were gavaged 
with 100 pl DHA oil (containing 26% DHA triglyceride and 6% EPA, total 
omega-3 is 35%) every 2 days for 2 weeks before conception in crosses with 
heterozygous males. During gestation, pregnant mice were continued on 
gavages of DHA every 2 days. Gavages of mothers continued during 
breastfeeding and pups were weaned onto normal diet at 3 weeks of age and 
gavaged every 2 days with DHA for 8 weeks. a, Brain weight of adult wild-type 
(n = 4) and knockout (n = 4) mice aged 8 weeks after treatment with dietary 
DHA oil. Knockout mice brains were still significantly smaller. b-e, DHA 
treatment of knockout mice did not reduce the strong anxiety phenotype as 
determined using the light-dark box test. To investigate why dietary DHA 
failed to rescue knockout phenotypes, we tested the hypothesis that uptake of 


maternally derived DHA (in this case the DHA delivered to the mother 

via gavage) might not get into the brain of knockout mice during brain 
development. To test this possibility, pregnant HET mothers intercrossed with 
knockout fathers were gavaged at E17.5-E19.5 with ['‘C]DHA and uptake 
into fetal brains was quantified. Note that heterozygous mice do not exhibit 
detectable phenotypes, have similar DHA to wild type (not shown) and are thus 
similar to wild type. The rationale for using HET and knockout intercrosses 
was to increase the yield of knockout mice in this study. The data shown in 

f indicate that brains of knockout mice exhibited an 80% reduction in the 
uptake of [\4C]DHA relative to HET mice with the same mothers (n = 4 
wild-type, n = 6 knockout). Therefore, Mfsd2a expressed during fetal 
development is important for DHA transport into brain. *P < 0.05, 

***D < 0.001. Data are expressed as mean = s.e.m. 
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Mfsd2a is critical for the formation and function of 


the blood-brain barrier 


Ayal Ben-Zvi!, Baptiste Lacoste!, Esther Kur', Benjamin J. Andreonel, Yoav Mayshar’, Han Yan! & Chenghua Gu! 


The central nervous system (CNS) requires a tightly controlled envi- 
ronment free of toxins and pathogens to provide the proper chemical 
composition for neural function. This environment is maintained by 
the ‘blood-brain barrier’ (BBB), which is composed of blood vessels 
whose endothelial cells display specialized tight junctions and extre- 
mely low rates of transcellular vesicular transport (transcytosis)'~*. 
In concert with pericytes and astrocytes, this unique brain endothe- 
lial physiological barrier seals the CNS and controls substance influx 
and efflux**. Although BBB breakdown has recently been associated 
with initiation and perpetuation of various neurological disorders, 
an intact BBB is a major obstacle for drug delivery to the CNS”""°. A 
limited understanding of the molecular mechanisms that control BBB 
formation has hindered our ability to manipulate the BBB in disease 
and therapy. Here we identify mechanisms governing the establish- 
ment ofa functional BBB. First, using a novel tracer-injection method 
for embryos, we demonstrate spatiotemporal developmental profiles 
of BBB functionality and find that the mouse BBB becomes func- 
tional at embryonic day 15.5 (E15.5). We then screen for BBB-specific 
genes expressed during BBB formation, and find that major facil- 
itator super family domain containing 2a (Mfsd2a) is selectively ex- 
pressed in BBB-containing blood vessels in the CNS. Genetic ablation 
of Mfsd2a results in a leaky BBB from embryonic stages through to 
adulthood, but the normal patterning of vascular networks is main- 
tained. Electron microscopy examination reveals a dramatic increase 
in CNS-endothelial-cell vesicular transcytosis in Mfsd2a~‘~ mice, 
without obvious tight-junction defects. Finally we show that Mfsd2a 
endothelial expression is regulated by pericytes to facilitate BBB integ- 
rity. These findings identify Mfsd2a as a key regulator of BBB function 
that may act by suppressing transcytosis in CNS endothelial cells. 
Furthermore, our findings may aid in efforts to develop therapeutic 
approaches for CNS drug delivery. 

Two unique features of the CNS endothelium determine BBB integ- 
rity (Extended Data Fig. 1). One is specialized tight junctions between 
a single endothelial cell layer lining the CNS capillaries, which form the 
physical seal between the blood and brain parenchyma’. In addition, CNS 
endothelial cells have lower rates of transcytosis than endothelial cells in 
other organs’. Peripheral endothelial cells display active vesicle traffick- 
ing to deliver nutrients to peripheral tissues, whereas CNS endothelial 
cells express transporters to selectively traffic nutrients across the BBB'*"". 
However, it is not clear when and how these properties are acquired. 
Furthermore, the molecular mechanisms that give rise to the unique 
properties of the CNS endothelium have not been identified. Although 
recent studies revealed molecular pathways involved in the development 
of the embryonic BBB’*”’, disruption of some of these genes affect vas- 
cular network development, making it difficult to determine whether 
barrier defects are primary or secondary to a broader vascular effect. 

Weaimed to first identify the developmental time-point when the BBB 
gains functional integrity, and then use that time window to profile BBB- 
specific genes when the BBB is actively forming, to maximize the chance 
of identifying key regulators. The prevailing view has been that the em- 
bryonic and perinatal BBB are not yet functional’. However, previous 


embryonic BBB functionality studies were primarily performed by trans- 
cardiac tracer perfusion, which may dramatically affect blood pressure, 
cause bursting of CNS capillaries, and artificially produce leakiness phe- 
notypes’”°. To circumvent these obstacles, we developed a method to 
assess BBB integrity during mouse development, in which a small vol- 
ume of tracer is injected into embryonic liver to minimize changes in 
blood pressure (Fig. 1a, see Supplementary Information for the method). 

Using this method, we identified the timing of BBB formation in the 
developing mouse brain and observed a spatial and temporal pattern of 
‘functional-barrier genesis’ (Fig. 1b). We found that in E13.5 cortex a 
10-kDa dextran tracer leaked out of capillaries and was taken up by non- 
vascular brain parenchyma cells (Fig. 1b, top panel). At E14.5, the tracer 
was primarily restricted to capillaries, but tracer was still detected out- 
side vessels (Fig. 1b, middle panel). In contrast, at E15.5, the tracer was 
confined to vessels with no detectable signal in the surrounding brain 
parenchyma, similar to the mature BBB (Fig. 1b, bottom panel). The 
development of BBB functionality differed across brain regions (Sup- 
plementary Information and Extended Data Fig. 2). These data dem- 
onstrate that following vessel ingression into the neural tube, the BBB 
gradually becomes functional as early as E15.5. 

Based on the temporal profile of BBB formation, we compared ex- 
pression profiles of BBB (cortex) and non-BBB (lung) endothelium at 
E13.5, using an Affymetrix array (Supplementary Information), and iden- 
tified transcripts with significantly higher representation in cortical than 
lung endothelium (Fig. 2). These transcripts included transporters, tran- 
scription factors, and secreted and transmembrane proteins (Fig. 2c). 
We were particularly interested in transmembrane proteins, owing to 
their potential involvement in cell-cell interactions that regulate BBB 
formation. 

One of the genes identified, Mfsd2a, had 78.8 times higher express- 
ion in cortical endothelium than in lung endothelium (Fig. 3a). In situ 
hybridization showed prominent Mfsd2a mRNA expression in CNS vas- 
culature but no detectable signal in vasculature outside the CNS, such 
as in lung or liver (Fig. 3b). Moreover, both Mfsd2a mRNA and Mfsd2a 
protein were absent in the choroid plexus vasculature, which is part of 
the CNS but does not possess a BBB’ (Fig. 3c, d, g). Mfsd2a expression 
in CNS vasculature was observed at embryonic stages (E15.5), postnatal 
days 2 and 5 (P2 and P5) and in adults (P90) (Fig. 3b-e and Extended 
Data Fig. 3). Finally, Mfsd2a protein, which is absent in the Mfsd2a’’~ 
mice (Fig. 3e)*’, was specifically expressed in claudin-5-positive CNS 
endothelial cells but not in neighbouring parenchyma cells (neurons or 
glia) or adjacent PdgfrB-positive pericytes (Fig. 3f). Previously, Mfsd2a 
was reported to be a transmembrane protein expressed in the placenta 
and testis, which have highly restrictive barrier properties’. Together 
with our demonstration of Mfsd2a-specific expression in BBB-containing 
endothelial cells, this suggests that Mfsd2a may have a role in BBB for- 
mation and/or function. 

To test this hypothesis, we examined BBB integrity in Mfsd2a_‘~ mice. 
Using our embryonic injection method, 10-kDa dextran was injected 
into Mfsd2a‘~ and wild-type littermates at E15.5. As expected, dextran 
was confined within vessels of control embryos. In contrast, dextran 
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leaked outside the vessels in Mfsd2a”/~ embryonic brains and was found 
in the cortical parenchyma (Fig. 4a) and individual parenchyma cells 
(quantified as tracer-positive parenchyma cells per unit area of the de- 
veloping lateral cortical plate; Fig. 4b). Furthermore, using imaging and 
spectrophotometric quantification methods’, we found that the leaky 
phenotype persisted in early postnatal (Extended Data Fig. 4) and adult 
(Fig. 4c) Mfsd2a~‘~ mice. Because the sequence of Mfsd2a has simi- 
larities to the major facilitator superfamily of transporters, and Mfsd2a 
facilitates the transport of tunicamycin in cancer cell lines”’, we injected 
two non-carbohydrate-based tracers of different sizes to rule out the pos- 
sibility that dextran leakiness is due to interactions with Mfsd2a. Sulfo- 
NHS-biotin (~550 Da) and horseradish peroxidase (HRP; ~44 kDa) 
tracers exhibited the leaky phenotype in Mfsd2a ‘~ mice (Extended Data 
Fig. 4a, b). Moreover, a larger molecular weight tracer, 70-kDa dextran, 
also displayed leakiness in Mfsd2a”’~ mice (Extended Data Fig. 4d). 
In contrast to severe barrier leakage defects (Fig. 4a—c and Extended 
Data Fig. 4), brain vascular patterning was similar between Mfsd2a’‘~ 
mice and littermate controls. No abnormalities were identified in capil- 
lary density, capillary diameter or vascular branching (Fig. 4d and Ex- 
tended Data Fig. 5a), in embryonic (E15.5), postnatal (P4), and adult 
(P70) brains of Mfsd2a ’ ~ mice. Moreover, we found no abnormalities 
in cortical arterial distribution in adult Mfsd2a/~ mice (Extended Data 
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Figure 1 | A novel tracer-injection method reveals a temporal profile of 
functional BBB formation in the embryonic cortex. a, In utero embryonic 
liver tracer injection method; fenestrated liver vasculature enabled rapid tracer 
uptake into the embryonic circulation. b, Dextran-tracer injection revealed a 
temporal profile of functional cortical BBB formation. Representative images of 
dorsal cortical plates from injected embryos after capillary labelling with lectin 
(green, lectin; red, 10-kDa tracer). Top panel (E13.5), tracer leaked out of 
capillaries and was subsequently taken up by non-vascular parenchyma cells 
(arrowheads), with little tracer left inside capillaries (arrow). Middle panel 
(E14.5), tracer was primarily restricted to capillaries (arrow), with diffused 
tracer detectable in the parenchyma (arrowheads). Bottom panel (E15.5), tracer 
was confined to capillaries (arrow). n = 6 embryos (3 litters per age). 
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Figure 2 | Expression profiling identifies genes involved in BBB formation. 
a, Dot-plot representation of Affymetrix GeneChip data showing 
transcriptional profile of cortical (BBB) and lung (non-BBB) endothelial cells 
isolated at the critical barrier-genesis period (E13.5). Dots reflect average 
expression of a probe in the cortex (x axis) and lung (y axis). Cortex expression 
values above 500 arbitrary expression units (a.u.) are presented. Red dots 
indicate a fivefold higher expression in the cortex. Mfsd2a value is circled in 
blue. b, Pan-endothelial markers were highly represented, whereas pericyte, 
astrocyte, and neuronal markers were detected at extremely low levels in 
both cortex and lung samples. c, Barrier-genesis specific transporters, 
transcription factors, and secreted and transmembrane proteins were 
significantly enriched in the cortical endothelial cells. All data are 

mean + s.d. n = 4 litters (4 biological replicates). 


Fig. 5b). Therefore, Mfsd2a is specifically required for proper forma- 
tion ofa functional BBB but not for CNS vascular morphogenesis in vivo. 
This result, together with the temporal difference between cortical vas- 
cular ingression (E10-E11) and cortical barrier-genesis (E13.5-E15.5), 
demonstrates that vascular morphogenesis and barrier genesis are dis- 
tinct processes. 

We next addressed whether Mfsd2a regulates endothelial tight-junction 
formation, transcytosis, or both. We examined these properties by elec- 
tron microscopy in embryonic brains and P90 mice following intrave- 
nous HRP injection’. Electron microscopy failed to reveal any apparent 
abnormalities in the ultrastructure of endothelial tight junctions (Fig. 5a). 
At E17.5, tight junctions in control and Mfsd2a ~~ littermates appeared 
normal, with electron-dense linear structures showing ‘kissing points’ 
where adjacent membranes are tightly apposed (Fig. 5a). In electron 
micrographs of cerebral cortex in HRP-injected adults, peroxidase activity 
was revealed by an electron-dense reaction product that filled the vessel 
lumen. In both control and Mfsd2a‘~ mice, HRP penetrated the inter- 
cellular spaces between neighbouring endothelial cells only for short 
distances. HRP was stopped at the tight junction, creating a boundary 
between HRP-positive and HRP-negative regions without leakage through 
tight junctions (Fig. 5a). In contrast, CNS endothelium of Mfsd2a ‘~ 
mice displayed a dramatic increase in the number of vesicles, including 
luminal and abluminal plasma membrane-connected vesicles and free 
cytoplasmic vesicles, which may indicate an increased rate of transcy- 
tosis (Fig. 5b). Specifically, pinocytotic events were evidenced by type II 
lumen-connected vesicles pinching from the luminal plasma membrane. 
Greater than twofold increases in vesicle number in Mfsd2a ‘~ mice 
compared to control littermates were observed in different locations 
along the transcytotic pathway (Fig. 5 and Extended Data Table 2). Fur- 
thermore, the HRP reaction product in adult mice was observed in vesicles 
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invaginated from the luminal membrane and exocytosed at the ablum- 
inal plasma membrane only in Mfsd2a‘~ mice (Fig. 5d), suggesting 
that HRP was subject to transcytosis in these animals but not in wild- 
type littermates (Extended Data Table 2). Together, these findings suggest 
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Figure 4 | Mfsd2a is required for the establishment of a functional BBB but 
not for CNS vascular patterning in vivo. a, b, Dextran-tracer (10 kDa) 
injections at E15.5 revealed a defective BBB in mice lacking Mfsd2a. a, The 
tracer was confined to the capillaries (arrow) in wild-type littermates, whereas 
Mfsd2a ‘~ embryos showed large amounts of tracer leakage in the brain 
parenchyma (arrowheads). b, Capillaries (arrows) surrounded by tracer-filled 
brain parenchyma cells (arrowheads) in M (fsd2a/— cortex. Quantification of 
tracer-filled parenchyma cells in control versus Mfsd2a ’~ cortical plates 
(bottom panel, n = 7 embryos per genotype). c, Spectrophotometric 
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Figure 3 | Mfsd2a is selectively expressed in BBB-containing CNS 
vasculature. a, At E13.5, Mfsd2a expression in cortical endothelium was 
~80-fold higher than in lung endothelium (microarray analysis, mean + s.d.). 
b-d, Specific Mfsd2a expression in BBB-containing CNS vasculature (blue, 
Mfsd2a in situ hybridization; green, vessel staining (PECAM) adjacent 
sections). b, Mfsd2a expression at E15.5 in CNS vasculature (sagittal view of 
brain and spinal cord, arrows), but not in non-CNS vasculature (asterisk). 

c, Mfsd2a expression at E15.5 in BBB vasculature (cortex coronal view, for 
example, striatum, arrow), but not in non-BBB CNS vasculature (choroid 
plexus, dashed line). d, High-magnification coronal view of Mfsd2a expression 
in BBB-containing CNS vasculature but not in vasculature of the choroid 
plexus (left, dashed line), or outer meninges or skin (right, red arrows). 

e-g, Immunohistochemical staining of Mfsd2a protein shows specific 
expression in CNS endothelial cells (red, Mfsd2a; green, claudin-5 or lectin 
(endothelium); blue, DAPI (nuclei); grey, PdgfrB (pericytes)). e, Mfsd2a 
expression in the brain vasculature of wild-type mice (top panel), but not of 
Mfsd2a ‘~ mice (bottom panel). f, Mfsd2a expression only in claudin-5- 
positive endothelial cells (arrow; endothelial nucleus is indicated by an asterisk) 
but not in adjacent pericytes (arrowhead; pericyte nucleus is indicated by a 
double asterisk). g, Lack of Mfsd2a expression in choroid plexus vasculature 
(fourth ventricle coronal view, dashed line), as opposed to the prominent 
Mfsd2a expression in cerebellar vasculature. n = 3 embryos (3 litters per age). 


that the BBB leakiness observed in Mfsd2a”’~ mice was not caused by 
opening of tight junctions, but rather by increased transcellular traf- 
ficking across the endothelial cytoplasm. 

Studies using pericyte-deficient genetic mouse models have shown 
that pericytes can also regulate BBB integrity. These mice had increased 
vesicle trafficking without obvious junction defects*’, similar to our ob- 
servations in Mfsd2a ‘~ mice. We therefore examined the possibilities 
that Mfsd2a may regulate CNS endothelial transcytosis by modulating 
pericyte function or that the effect of pericytes on endothelial transcy- 
tosis is mediated by Mfsd2a. First, pericyte coverage, attachment to the 
capillary wall, and pericyte ultrastructure and positioning relative to 
endothelial cells were normal in Mfsd2a ‘~ mice (Extended Data Fig. 6). 
These data, together with the lack of Mfsd2a expression in pericytes, 
suggest that the increased transcytosis observed in Mfsd2a’ ‘~ endothe- 
lial cells is not secondary to pericyte abnormalities. Second, a genetic 
reduction in pericyte coverage can influence endothelial gene expression 
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quantification of 10-kDa dextran-tracer from cortical extracts of P90 mice, 16h 
post intravenous injection, indicating that BBB leakiness in Mfsd2a’~ mice 
persists into adulthood (n = 3 mice per genotype). d, Mfsd2a”’~ mice 
exhibit normal vascular patterning. No abnormalities were found in cortical 
vascular density, branching and capillary diameter (E15.5; green, PECAM). 
Quantification of wild-type and Mfsd2a ’~ samples (n = 4 embryos per 
genotype). All data are mean + s.e.m. MUT, mutant; N.S., not significant; WT, 
wild type. *P < 0.05 (Mann-Whitney U-test). 
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Figure 5 | Mfsd2a is required specifically to suppress transcytosis in brain 
endothelium to maintain BBB integrity. Electron-microscopy examination 
of BBB integrity. a, Embryonic Mfsd2a ‘~ endothelium (E) showed no overt 
tight-junction ultrastructural defect (left, normal ‘kissing points’, small arrows). 
The vessel lumen (L) in HRP-injected adult mice was filled with electron-dense 
3-3' diaminobenzidine (DAB) reaction (black) that diffused into intercellular 
clefts but stopped sharply at the junction without parenchymal leakage (right, 
arrows). b, Increased vesicular activity in embryonic Mfsd2a’~ endothelium 
(E17.5). Left, wild-type endothelium displayed very few vesicles (arrow). 
Right, Mfsd2a”‘~ endothelium contained many vesicles of various types: 


profiles*’. Therefore we analysed published microarray data of two 
pericyte-deficient mouse models” and found a dramatic downregulation 
of Mfsd2a in these mice, with a direct correlation between the reduction 
of Mfsd2a gene expression and the degree of pericyte coverage (Extended 
Data Fig. 7a). Furthermore, immunostaining for Mfsd2a in Pdgfb"*""** 
mice’ revealed a significant decrease in Mfsd2a protein levels in endo- 
thelial cells that are not covered by pericytes (Extended Data Fig. 7b-d). 
Therefore, it is plausible that the increased vesicular trafficking pheno- 
type observed in pericyte-deficient mice is, at least in part, mediated by 
Mfsd2a, and that endothelial—pericyte interactions control the expres- 
sion of Mfsd2a, which in turn controls BBB integrity. 

We demonstrate that Mfsd2a is required to suppress endothelial 
transcytosis in the CNS. Because of Mfsd2a’s involvement in human 
trophoblast cell fusion** and of our genetic evidence for its role in sup- 
pressing transcytosis, we propose that Mfsd2a serves as a cell-surface 
molecule to regulate membrane fusion or trafficking. Indeed, from 
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immuno-electron-microscopy examination, Mfsd2a protein was found 
in the luminal plasma membrane and associated with vesicular struc- 
tures in cerebral endothelial cells, but not in tight junctions (Extended 
Data Fig. 8). At present, it is not clear whether the reported transporter 
function of Mfsd2a is related to its role in BBB formation. 

BBB breakdown has been reported in the aetiology of various neuro- 
logical disorders’""°, and two separate Mfsd2a-deficient mouse lines were 
reported to exhibit neurological abnormalities, such as ataxic behaviour”. 
Finding a novel physiological role of Mfsd2a may provide a valuable 
tool to address how a non-functional BBB could affect brain develop- 
ment. In addition, our finding also highlights the importance of the 
transcytotic mechanism in BBB function, whereas most previous atten- 
tion has been focused on potential BBB leaks through intercellular junc- 
tions. Indeed, increased numbers of pinocytotic vesicles were observed 
following acute exposure to external stress inducers in animal models”*, 
and have also been observed in human pathological conditions’. It will 
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be interesting to examine whether Mfsd2a is involved in these patho- 
logical and acute assault situations. We cannot be certain that the ele- 
vated levels of transcytosis in Mfsd2a‘~ mice were not due to some 
form of acute cellular stress, but it is very unlikely. This is because under 
stress, either cells respond to restore homeostasis or cell death occurs”. 
However, increased transcytosis in Mfsd2a”‘~ mice persists from em- 
bryonic stages to adulthood, and up to 6 months of age these mice exhibit 
no sign of vascular degeneration (Extended Data Fig. 5c). Our identifica- 
tion of a key molecular player in BBB formation may also aid efforts to 
develop therapeutic approaches for efficient drug delivery to the CNS. 
As an accessible cell surface molecule, Mfsd2a is poised to be a poten- 
tial therapeutic target for BBB restoration and manipulation. 


METHODS SUMMARY 


The lowest volume of 10-kDa dextran tetramethylrhodamine, lysine fixable (D3312 
Invitrogen) that still facilitated full perfusion was injected into the embryonic liver, 
while keeping the embryo connected to the maternal blood circulation through the 
umbilical cord. After 3 minutes of tracer circulation, embryonic heads were fixed 
by immersion in 4% paraformaldehyde (PFA) overnight at 4 °C, cryopreserved in 
30% sucrose and frozen in TissueTek OCT (Sakura). Sections of 12 |1m were then 
collected and post-fixed in 4% PFA at room temperature (20-25 °C) for 15 min, 
washed in PBS and co-stained with either «-PECAM antibody or with isolectin B4 
to visualize blood vessels (see Methods for details). P90 HRP injection and E17.5 
cortex capillaries transmission electron microscopy (TEM) imaging was done as 
described previously’. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Animals. Wild-type Swiss-Webster mice (Taconic Farms) were used for embryonic 
BBB functionality assays and expression profiles. Homozygous Tie2-GFP trans- 
genic mice (Jackson laboratory, strain 003658) were used for BBB transcriptional 
profiling. Mfsd2a-null mice”! (Mouse Biology Program, University of California, 
Davis —MMRERC strain 032467-UCD, B6;129S5-Mfsd2atm1Lex/Mmucd) were 
maintained on C57Bl/6;129SVE mixed background and used for testing the in- 
volvement of Mfsd2a in barrier genesis. Mfsd2a-null mutant mice were genotyped 
using the following PCR primers: 5'-CCTGGTTTGCTAAGTGCTAGC-3’ and 
5'-GTTCACTGGCTTGGAGGATGC-3’, which provide a 210-bp product for the 
Mfsd2a wild-type allele; and 5’'-CACTTCCTAAAGCCTTACTTC-3’ and 5’-GC 
AGCGCATCGCCTTCTATC-3’, which provide a 301-bp product for the Mfsd2a- 
knockout allele. 

Pregnant mice were obtained following overnight mating (day of vaginal plug 
was defined as embryonic day 0.5). 

All animals were treated according to institutional and US National Institutes of 
Health (NIH) guidelines approved by the Institutional Animal Care and Use Com- 
mittee (IACUC) at Harvard Medical School. 

Immunohistochemistry. Tissues were fixed with 4% paraformaldehyde (PFA) at 
4 °C overnight, cryopreserved in 30% sucrose and frozen in TissueTek OCT (Sakura). 
Tissue sections were blocked with 5% goat serum, permeabilized with 0.5% Triton 
X-100, and stained with the following primary antibodies: «-PECAM (1:500; 553370, 
BD PharmingenTM), «-Claudin5 (1:400; 35-2500, Invitrogen), «-Mfsd2a (1:500; 
Cell Signaling Technologies (under development)), «-PdgfrB (1:100; 141402, eBio- 
science), 2-CD31 (1:100; 558744, BD PharmingenTM), «-SMA (1:100; C6198, Sigma 
Aldrich), followed by 568/488 Alexa Fluor-conjugated secondary antibodies (1:300- 
1:1000, Invitrogen) or with Isolectin B4 (1:500; 121411, Molecular Probes). Slides 
were mounted in Fluoromount G (EMS) and visualized by epifluorescence, light, 
or confocal microscopy. 

In situ hybridization. Tissue samples were frozen in liquid nitrogen and embed- 
ded in TissueTek OCT (Sakura). Sections (18 11m) were hybridized with a digox- 
igenin (DIG)-labelled mouse Mfsd2a antisense riboprobe (1,524-2,024 bp NM_ 
029662) at 60 °C overnight. A sense probe was used to ensure signal specificity. For 
detection, signals were developed using anti-DIG antibody conjugated with alkal- 
ine phosphatase (Roche). After antibody treatment, sections were incubated with 
BM Purple AP Substrate (Roche). 

Embryonic BBB permeability assay. The method is based on the well-established 
adult BBB dye-injection assay with special considerations for the injection site and 
volume to cater the nature of embryonic vasculature”. 

Four major modifications were made: first, embryos were injected while still attached 
via the umbilical cord to the mother’s blood circulation, minimizing abrupt changes 
in blood flow. Deeply anaesthetized pregnant mice were used. 

Second, taking advantage of the sinusoidal, fenestrated and most permeable liver 
vasculature, dye was injected using a Hamilton syringe into the embryonic liver and 
was taken into the circulation in a matter of seconds. 

Third, dye volume was adjusted to a minimum that still allows detection in all 
CNS capillaries after 3 min of circulation. High-fluoresce-intensity dye enables the 
use of small volumes and facilitates detection at the single-capillary level (10-kDa 
dextran-tetramethylrhodamine, lysine fixable, 4 mg ml! (D3312 Invitrogen), 1 pl 
for E13.5, 2 pl for E14.5, 5 pl for E15.5). 

Fourth, traditional perfusion fixation was omitted, again to prevent damage to 
capillaries. Instead, fixable dyes were used to allow reliable immobilization of the 
dye at the end of the circulation time (relatively small embryonic brain facilitates 
immersion fixation). 

Embryonic heads were fixed by immersion in 4% PFA overnight at 4 °C, cryo- 

preserved in 30% sucrose and frozen in TissueTek OCT (Sakura). Sections of 12 1m 
were then collected and post-fixed in 4% PFA at room temperature for 15 min, 
washed in PBS and co-stained with either «-PECAM antibody or with isolectin B4 
to visualize blood vessels. All embryos from each litter were injected blind before 
genotyping. 
Postnatal and adult BBB permeability assay. P2-P5 pups were deeply anaes- 
thetized and three methods were used: the first method involved injection of 10 pl 
of 10-kDa or 70-kDa dextran tetramethylrhodamine (4 mg ml” ' D3312 Invitrogen) 
into the left ventricle with a Hamilton syringe. After 5 min of circulation, brains 
were dissected and fixed by immersion in 4% PFA at 4 °C overnight, cryopreserved 
in 30% sucrose and frozen in TissueTek OCT (Sakura). Sections of 12 [tm were col- 
lected and post-fixed in 4% PFA at room temperature for 15 min, washed in PBS 
and co-stained to visualize blood vessels with either 7-PECAM primary antibody 
(1:500; 553370, BD Pharmingen), followed by 488-Alexa Fluor conjugated second- 
ary antibody (1:1000, Invitrogen) or with isolectin B4 (1:500; 121411, Molecular 
Probes). 

The second method involved injection of 10 pl of HRP type II (5 mg ml! 
P8250-50KU Sigma-Aldrich) into the left heart ventricle with a Hamilton syringe. 


After 5 min of circulation brains were dissected and immersion fixed in 2% glu- 
taraldehyde in 4% PFA in cacodylate buffer (0.1 M, pH 7.3) at room temperature 
for 1 h then at 4 °C for 3 h then washed in cacodylate buffer overnight. Cortical- 
vibratome sections (100 jum) were processed in a standard DAB reaction. 

The third method involved the use of EZ-link NHS-sulfo-biotin as a tracer, as 
described previously”. 

Imaging. Nikon Eclipse 80i microscope equipped with a Nikon DS-2 digital camera 
was used to image HRP tracer experiments, vasculature density and pericyte cov- 
erage comparisons and expression analyses. Zeiss LSM 510 META upright confocal 
microscope was used to image Dextran and NHS-sulfo-biotin BBB permeability 
assays. A Nikon FluoView FV 1000 laser scanning confocal microscope and a Leica 
SP8 laser scanning confocal microscope were used for imaging Mfsd2a and pericyte 
marker immunohistochemistry. Images were processed using Adobe Photoshop 
and ImageJ (NIH). 

Morphometric analysis of vasculature. Coronal sections (25-1m thick) of E15.5, 
P4and P70 brains were immunostained for PECAM. For vascular density and branch- 
ing, confocal images were acquired with a Nikon FluoView FV 1000 laser scanning 
confocal microscope and maximal projection images (5 per animal) were used for 
quantifications. The number of branching points was manually counted. Capillary 
density was quantified using MetaMorph software (Universal Imaging, Downing- 
town, Pennsylvania) by measuring the area occupied by PECAM-positive vessels 
per cortical area. The mean capillary diameter was measured manually in ImageJ from 
cross-sectional vascular profiles (20 per animal) on micrographs (5-7 per animal) 
taken under a X60 objective with a <2 digital zoom. 

For artery distribution quantification, 25-,tm-thick sections (P60) were stained 

for smooth muscle actin (SMA) and PECAM. The proportion of PECAM- positive 
brain vessels with artery (SMA) identity was quantified using MetaMorph and ex- 
pressed as percent of controls. Quantification was carried out blind. 
Quantification of cortical-vessel pericyte coverage. Pericyte coverage of cortex 
vessels in Mfsd2a’’~ and wild-type littermate control mice was quantified by ana- 
lysing the proportion of total claudin-5-positive endothelial length also positive for 
the pericyte markers CD13 or PdgfrB. Immunostaining was performed on 20-pm 
sections of P5 cortex. In each animal, 20 images of 10 different sections were ana- 
lysed. Microvasculature was found to be completely covered by pericytes in both 
control and Mfsd2a”’~ mice and therefore no error bars are presented for the aver- 
age pericyte coverage in Extended Data Fig. 6a, b (n = 3). All the analysis was done 
with ImageJ (NIH). Quantification was carried out blind. 
Quantification of vessel leakage. Epifluorescence images of sections from in- 
jected tracer and co-stained with lectin were analysed manually with ImageJ (NIH). 
Coronal cortical sections (12 1m) of the same rostrocaudal position were used for 
the analysis. The same acquisition parameters were applied to all images and the 
same threshold was used. Tracer-positive cells found outside a vessel (parenchyma) 
were used as a parameter for leakage. For each embryo, at least 20 sections ofa fixed 
lateral cortical plate area were scored. Four arbitrary leakage groups were classified 
based on the number of tracer parenchyma positive cells per section (0, 1-5, 5-10 and 
10-40). Average representation of each leakage group was calculated for Mfsd2a/~ 
and control embryos. Quantification was carried out blind. 

Spectrophotometric quantification of 10-kDa fluoro-ruby-dextran tracer was car- 
ried out from cortical extracts, 16 h after tail-vein injections in adult mice, as des- 
cribed previously’. 

Transmission electron microscopy. TEM imaging of P90 HRP injection and E17.5 
cortex capillaries was carried out as described previously*. HRP (10 mg (per 20 g); 
Sigma Aldrich, HRP type II) were dissolved in 0.4 ml of PBS and injected into the 
tail veins of deeply anaesthetized P90 mice. After 30 min of HRP circulation, brains 
were dissected and fixed by immersion in a 0.1 M sodium-cacodylate-buffered 
mixture (5% glutaraldehyde and 4% PFA) for 1 h at room temperature followed by 
5 h in PFA at 4 °C. Following fixation, the tissue was washed overnight in 0.1 M 
sodium-cacodylate buffer and then cut in 50-m-thick free-floating sections using 
a vibrotome. Sections were incubated for 45 min at room temperature in 0.05 M 
Tris-HCl pH 7.6 buffer, containing 5.0 mg per 10 ml of 3-3’ diaminobenzidine 
(DAB, Sigma Aldrich) with 0.01% hydrogen peroxide. Sections were then post- 
fixed in 1% osmium tetroxide and 1.5% potassium ferrocyanide and dehydrated 
and embedded in epoxy resin. E17.5 samples were processed as the P90 samples 
without HRP injection and with longer fixation times (2-3 days in room temper- 
ature). Ultrathin sections (80 nm) were then cut from the block surface, collected 
on copper grids, stained with Reynold’s lead citrate and examined under a 1200EX 
electron microscope (JEOL) equipped with a 2k CCD digital camera (AMT). 

Immunogold labelling for electron microscopy. Mice were deeply anaesthetized 
and perfused through the heart with 30 ml of PBS followed by 150 ml of a fixative 
solution (0.5% glutaraldehyde in 4% PFA prepared in 0.1 mM phosphate buffer, 
pH7.4), and then by 100 ml of 4% PFA in phosphate buffer. The brain was removed 
and post fixed in 4% PFA (30 min, 4 °C) and washed in PBS. Coronal brain sections 
(50-|1m thick) were cut on the same day with a vibratome and processed free floating. 
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Sections were immersed in 0.1% sodium borohydride in PBS (20 min, room 
temperature), rinsed in PBS and pre-incubated (2 h) in a blocking solution of 
PBS containing 10% normal goat serum, 0.5% gelatine and 0.01% Triton. Incuba- 
tion (24 h, 20-25 °C) with rabbit anti-Mfsd2a (1:100; Cell Signaling Technologies 
(under development)) primary antibody was followed by rinses in PBS and incu- 
bation (overnight, 20-25 °C) in a dilution of gold-labelled goat anti-rabbit IgGs 
(1:50; 2004, Nanoprobes). After washes in PBS and sodium acetate, the size of 
immunogold particles was silver-enhanced and sections rinsed in phosphate buf- 
fer before processing for electron microscopy. 

Statistical analysis. Comparison between wild-type and Mfsd2a ’" pericyte cov- 
erage and spectrophotometric quantification of 10-kDa fluoro-ruby-dextran tracer 
leakage was performed by a Mann-Whitney U-test (appropriate for small sample 
size; each embryo was considered as a sample). An unpaired student’s t-test was used 
(GraphPad Prism 4 Software) for comparison between wild-type and Mfsd2a/~ 
for vascular density, artery distribution, number of vesicular types, mean capillary 
diameter and Mfsd2a expression in pericyte deficient mice. P< 0.05 was considered 
significant (StatXact Cytel Software Corporation,Cambridge, Massachusetts, USA). 
Transcriptional profiling. E13.5 Tie2-GFP embryos were micro-dissected for cor- 
tex and lungs. Cortex tissue was carefully cleared of the meninges and choroid plexus. 
FACS purification of GFP-positive cells and GeneChip analysis was performed as 
described previously*'. RNA was purified with Arcturus PicoPure RNA isolation 
kit (Applied biosystems), followed by NuGEN Ovation V2 standard linear amplifica- 
tion and hybridization to Affymetrix Mouse Genome 430 2.0 Array. All material 
from a single litter (10-13 embryos) was pooled and considered as a biological rep- 
licate. Four biological replicates were used. Each biological replicate represents 
purification from different litters performed on different days. 

Transcriptional profile analysis of pericyte deficient mice. Expression data from 
a published study of pericyte-deficient mice” were obtained from the Gene Expression 


/ 


LETTER 


Omnibus (http://www.ncbi.nlm.nih.gov/geo, accession number GSE15892). All 
microarrays were analysed using the MASS probe set condensation algorithm with 
Expression Console software (Affymetrix). P values were determined using a two- 
tailed student’s t-test (n = 4). 

Mfsd2a protein expression in Pdgfb'’"* mice. Brain samples from P10-P14 
mice and controls were kindly provided by C. Betsholtz. Sample processing and im- 
munohistochemistry was carried out as described for all other samples in our study. 
Mfsd2a staining quantification was carried out with 12-j1m cortical sagittal sec- 
tions. Confocal images were acquired with a Nikon FluoView FV 1000 laser scan- 
ning confocal microscope. Quantification of mean grey value per vascular profile 
was done with Image] (NIH) by outlining vascular profiles according to lectin stain- 
ing and measuring Mfsd2a intensity in these areas. In all images, PdgfrB antibody 
staining was used to test presence of pericytes in quantified vessels. n = 2 animals 
per genotype, 60 images quantified of at least 600 vascular profiles per animal. 
Quantification was carried out blind. 
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endothelium 


Extended Data Figure 1 | Diagram illustrating two unique BBB properties _ specialized tight junctions sealing the space between adjacent cells (1), and an 
of CNS endothelial cells. Compared to the endothelial cells from the rest ofthe unusually low rate of transcytosis from the vessel lumen to the brain 
body, CNS endothelial cells that possess a BBB are characterized by highly parenchyma (2). 


©2014 Macmillan Publishers Limited. All rights reserved 


E13.5 


E14.5 


E14.5 hindbrain 


E14.5 midbrain 


Dorsal 


Ventral 


Dorsal 


Ventral 


Lectin 


Lectin 


Lectin 


10-kDa tracer 


10-kDa tracer 


Dorsal-medial 


Ventral-lateral 


10-kDa tracer 


10-kDa tracer 


Overlay 


Overlay 


Overlay 


Overlay 


LETTER 


Extended Data Figure 2 | Spatial and temporal BBB maturation across 
brain regions and cortical regions. a, b, Embryonic BBB develops in a caudal- 
to-rostral spatial pattern. Dextran-tracer (10 kDa) injection revealed that the 
BBB is already functional in the hindbrain (a) and midbrain (b) at E14.5, a time- 
point at which cortical BBB is still leaky. Epifluorescence (low magnification) 
and confocal (high magnification) images of brain sections from injected 
embryos are shown. As illustrated in both a and b, tracer was confined to blood 
vessels (arrows). c-e, Cortical BBB develops in a ventrolateral to dorsomedial 
spatial pattern. c, Diagram of the embryonic cortex indicating dorsal-medial 
and ventral-lateral cortical regions illustrated in d ande. d, At E13.5, the BBB of 
the dorsal cortex (top panels) is not fully formed, as evidenced by little tracer 
inside the blood vessels (arrow) and tracer-filled parenchymal cells 
(arrowheads). In the ventral cortex (bottom panels), capillaries were better 
sealed, showing more tracer within the lumen (arrow) and less tracer in the 
brain parenchyma (arrowhead). e, At E14.5, the capillaries in the dorsal cortex 
(top panels) were still leaky, with little tracer inside the capillaries (arrow) and 
tracer-stained surrounding parenchyma (arrowheads). At the same age, the 
BBB of the ventral cortex (bottom panels) was already functional, with all tracer 
confined to the capillaries (arrow). Green, lectin; red, 10-kDa tracer. n = 6 
embryos from 3 litters for each age. 
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a Mfsd2a PECAM Mfsd2a/PECAM/DAPI Extended Data Figure 3 | Mfsd2a protein is selectively expressed in 


BBB-containing CNS vasculature of both embryos and adults. 
Immunohistochemical staining of Mfsd2a protein demonstrating its specific 
expression in BBB-containing CNS vasculature. Red, Mfsd2a; green, PECAM 
(endothelium); blue, DAPI (nuclei). a, At E13.5, the time of BBB establishment, 
Mfsd2a expression was detected in the brain (top panels) but not in the 


Brain 


lung vessels (bottom panels), confirming the microarray data. b, At E15.5,the 
first developmental time-point of BBB functionality, Mfsd2a expression 

was detected in spinal cord (top panels) but not in lung (middle panels) or liver 
vessels (bottom panels). c, Mfsd2a selective expression in BBB-containing 
vessels persists in adult mice (P90) as shown in brain vessels (top panels) but 
not in lung or liver (bottom panels). d, Mfsd2a is expressed in cerebral vessel 
(arrow) but not in pial vessels (arrow heads). Low (top panel) and high 
magnification (bottom panel) of pial-cerebral boundary (dotted line) of P5 
dorsal cortex (wild-type mice). n = 3 embryos from 3 litters for each age. 
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Extended Data Figure 4 | The leaky BBB phenotype in Mfsd2a~’~ mice 
persists after birth and is not restricted to carbohydrate-based tracers. 

a, b, Injection of two non-carbohydrate-based tracers with different molecular 
weight and different molecular compositions at P4 revealed a persistent leaky 
barrier phenotype in mice lacking Mfsd2a. a, The small-molecular-weight 
tracer sulfo- NHS-biotin (~550 Da), was confined to vessels in wild-type 
controls (upper panels), whereas it leaked out of the vessels (arrowheads) in 
Mfsd2a‘ ~ mice (lower panels). Green, lectin; red, tracer. b, The high- 
molecular-weight protein tracer HRP (~44 kDa) was confined to vessels in 
control mice at P4 (left), whereas in mice lacking Mfsd2a (right), tracer was 
diffusing into the brain parenchyma (arrowheads). Brown, visualization of 
HRP in light microscopy by DAB reaction. Stainings were carried out on 
100-j1m cortical sections of tracer-injected animals. n = 4 pups per genotype 
from 3 litters. c, d, BBB leakage of tracer with 70 kDa molecular weight is 
observed in postnatal Mfsd2a ’ ~ mice. Green, lectin or PECAM (vessels); 
red, tracer. The 10-kDa (a) and 70-kDa (b) tracers fluoro-ruby-dextran were 
confined to vessels in control mice (top panels), but not in Mfsd2a’~ mice 
(bottom panels), where tracer was taken up by brain parenchymal cells 
(arrowheads). Stainings were carried out on 12-1m cortical sections of 
tracer-injected animals. n = 3 pups per genotype from 3 litters. 
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Extended Data Figure 5 | Perinatal and adult mice lacking Mfsd2a do not Mfsd2a’ ~ mice. Images of coronal cortical sections (left, green, PECAM; red, 
display changes in cerebrovascular network properties or signs of vascular SMA) and quantification (right). Data are mean + s.e.m. n = 3 animals per 
degeneration. a, No abnormalities were found in cortical capillary densityand genotype, 20 sections per animal. c, Electron-microscopy examination of older 
vascular branching (top panels) as well as capillary diameter (bottom panels) | Mfsd2a~’~ mice did not reveal signs of cerebrovascular degeneration. Left, 


of postnatal (P4, left) and adult (P70, right) Mfsd2a~’ ~ mice. Images of the overall capillary structure (for example, cell size, shape of the nucleus, 
vascular staining in coronal cortical sections and high-magnification images thickness of the vessel wall, basement membrane integrity and pericyte 

of cross-section profiles of capillaries (green, PECAM) and quantification. attachment) did not differ between wild-type and mutant mice. Right, at 
Data are mean + s.e.m. n = 3 animals per genotype, 20 sections per animal. higher magnification, normal features, such as pericyte (asterisk) attachment 
b, Immunostaining for smooth-muscle actin (arterial identifier, arrows) within a normal basement membrane (between arrows), could be observed 
revealed no abnormalities in arterial distribution and specification in in mice lacking Mfsd2a. 
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Extended Data Figure 6 | Pericyte coverage, attachment and ultrastructure 
are normal in Mfsd2a ’~ mice. a, b, Mfsd2a~’ ~ mice exhibit normal pericyte 
coverage. Co-staining of endothelium (claudin-5, red) and pericytes (CD13 in 
a and Pdgfrf in b, green) revealed no overt difference in pericyte coverage of 
dorsal cortex vessels between wild-type (top row) and Mfsd2a’ ~ (bottom row) 
mice at P5. Quantification of vascular coverage in both a and b showed no 
significant difference between wild-type and Mfsd2a ‘~ samples (P > 0.5). 
Data are mean + s.e.m. n = 3 pups per genotype, 20 sections per animal. 

c, E17.5 Mfsd2a’’~ mice exhibit normal pericyte-endothelial attachment. 
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High magnification images of capillary cross sections co-staining for 
endothelium (Glut1, red) and pericytes (PdgfrB, green) revealed no difference 
in pericyte-endothelial relationships between wild-type (top panels) and 
Mfsd2a ‘~ (bottom panels) mice (endothelial nucleus and pericyte nucleus 
are indicated by single and double asterisks, respectively). d, Electron 
micrographs of longitudinal capillary sections revealed that pericytes had 
normal appearance and were well positioned on the vessel walls in Mfsd2a /~ 
adult mice; pericytes were adjacent to endothelial cells and shared a common 
basement membrane. L, lumen; P, pericyte. 
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Extended Data Figure 7 | Mfsd2a gene expression and Mfsd2a protein levels 
are downregulated in mouse models of reduced pericyte coverage. 

a, Analysis of microarray data’ showed high levels of Mfsd2a expression in 
wild-type adult brain microvasculature, but a significant decrease in levels of 
Mfsd2a expression in mice that have reduced pericyte coverage at the BBB. 
Pdgfb'*""*' mice (mouse model 1), where PdgfB binding to heparan sulphate 
proteoglycans was disrupted, exhibited a major loss of pericyte coverage (74% 
reduction)’ and showed a dramatic decrease in Mfsd2a expression (74% 
reduction) compared to that of littermate controls. Similarly, Tie2"/R26°*, 
Pdgfb‘~ mice (mouse model 2) in which Pdgfb-null alleles were 
complemented by one copy of human PDGFB transgene showed a less 
dramatic loss of pericyte coverage (60% reduction)* and a smaller decrease in 
Mfsd2a expression (53% reduction). **P = 0.004, ***P = 1 x 10°). Bars 
reflect normalized signal of the Mfsd2a probe (1428223_at) in adult brain or 
cortex microvascular fragments (a.u.). Data are mean = s.d. of 4 biological 
replicates. b, Mfsd2a immunostaining in cerebrovasculature of postnatal 


LETTER 


Padgfb'*""*' mice and littermate controls (Pdgfb"”*) revealed reduced Mfsd2a 
protein expression in endothelial cells that are not covered by pericytes. Cross 
section: red, Mfsd2a; green, claudin-5 (endothelium); blue, DAPI (nuclei); grey, 
PdgtrB (pericyte). Reduced Mfsd2a signal was observed in endothelial cells of 
capillaries not covered with pericytes in Pdgfb"*"*' mice (arrow, bottom 
panels), whereas strong Mfsd2a signal was apparent in pericyte-covered 
capillaries in Pdgfb'*’* mice (arrow, top panel). Insets in b’ demonstrate in 
high magnification that Mfsd2a expression is restricted to endothelial cells (co- 
localization with claudin-5 staining) and absent in pericytes (none co- 
localization with PdgfrB staining). c, Additional example of the reduction in 
Mfsd2a expression in endothelial cells of Pdgfb""”"* mice (longitudinal section). 
d, Quantification of mean fluorescence intensity per vascular profile showed 
significant reduction of Mfsd2a signal in Pdgfb"“"*' capillaries compared to 
controls. Data are mean + s.e.m. n = 2 animals per genotype, 60 images 
quantified of at least 600 vascular profiles per animal. 
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a Wild-type Mfsd2a" 


Wild-type 


(arrows) and in the cytoplasm (arrowheads), but not in tight junctions 
(asterisk). Bottom panels (iv-vi), high magnification representative examples of 
Mfsd2a localization on the luminal plasma membrane (arrows), associated with 
luminal invaginating vesicles (iv, v) and with cytoplasmic vesicles 
(arrowheads). All samples are of cortical vessels from adult mice (P30-P90). 
n= 2 for each genotype. L, lumen. 


Extended Data Figure 8 | Immuno-electron-microscopy reveals the 
subcellular localization of Mfsd2a on the plasma membrane and vesicles, 
but not in tight junctions of cerebral endothelial cells. a, Electron 
micrographs showing silver-enhanced immunogold labelling of Mfsd2a in 
cerebral cortex capillaries from wild-type (left) but not in Mfsd2a~/~ mice 
(right), demonstrating staining specificity. b, Top panels (i-iii), three 
representative examples of Mfsd2a localization on the plaama membrane 
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Extended Data Table 1 | Expression-profile comparison of cortex (BBB) and lung (non-BBB) endothelial cells 
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a, Pan-endothelial markers were highly expressed in endothelial cells isolated from both cortex and lung of the endothelial specific Tie2-GFP reporter mouse at E13.5. b, Genes involved in the transport of molecules 
across the BBB, known as adult BBB markers, were highly enriched in brain versus lung endothelial cells. Data are mean of 4 biological replicates (4 litters). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 2 | Vesicular activity and transcytosis events in the brain endothelial cells of Mfsd2a~’~ mice are dramatically increased 

a 

Density of vesicles in the embryo brain endothelium (E17.5) 

Mean vesicular density 
No. of 

Tissue endothelial No. of Luminal type I Luminal type II Cytoplasmic Abluminal 

source profiles vesicles vesicles (/pum) vesicles (/pum) vesicles (/pum?) vesicles (/1m) 

Controls 40 1180 0.34 + 0.05 0.14 + 0.03 2.04 + 0.06 0.21 + 0.03 

-/- x EK 2K * 

Mfsd2a 40 2449 0.62 + 0.03 0.38 + 0.03 4.62 + 0.30 0.48 + 0.04 ** 

b 


Density of HRP-filled vesicles in adult brain endothelium (P90) 


No. of 
Tissue endothelial No. of HRP-filled Cytoplasmic HRPt 
source profiles vesicles vesicles (/pm?) 
Controls 15 0 0 
-/- 
Mfsd2a 15 97 3.35 + 0.55 


a, Quantification of the vesicular density (both total and individual type of vesicles) in E17.5 control and mutant endothelium. Mean vesiclar density was calculated from the number of vesiclar types per pum of 


luminal membrane (luminal type | and type Il vesicles), per pm? of cytoplasm (cytoplasmic vesicles), and 
injected mice. No HRP-filled vesicles were found in wild-type mice. Data are mean = s.e.m. from 4 control 
Student's t-test. 


per um of abluminal membrane (abluminal vesicles). b, Quantification of HRP luminal uptake in P90 HRP- 
s and 4 mutants (10 vessels per animal, 2 images at x 12,000 per vessel). **P < 0.01, ***P< 0.001, using 
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Structure of the AcrAB-TolC multidrug efflux pump 


Dijun Du!, Zhao Wang’, Nathan R. James!, Jarrod E. Voss!, Ewa Klimont!, Thelma Ohene-Agyei’, Henrietta Venter*, Wah Chiu? 
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The capacity of numerous bacterial species to tolerate antibiotics 
and other toxic compounds arises in part from the activity of 
energy-dependent transporters. In Gram-negative bacteria, many 
of these transporters form multicomponent ‘pumps’ that span 
both inner and outer membranes and are driven energetically by 
a primary or secondary transporter component’. A model system 
for such a pump is the acridine resistance complex of Escherichia 
coli’. This pump assembly comprises the outer-membrane channel 
TolC, the secondary transporter AcrB located in the inner mem- 
brane, and the periplasmic AcrA, which bridges these two integral 
membrane proteins. The AcrAB-TolC efflux pump is able to trans- 
port vectorially a diverse array of compounds with little chemical 
similarity, thus conferring resistance to a broad spectrum of anti- 
biotics. Homologous complexes are found in many Gram-negative 
species, including in animal and plant pathogens. Crystal struc- 
tures are available for the individual components of the pump”” 
and have provided insights into substrate recognition, energy 
coupling and the transduction of conformational changes assoc- 
iated with the transport process. However, how the subunits are 
organized in the pump, their stoichiometry and the details of their 
interactions are not known. Here we present the pseudo-atomic 
structure of a complete multidrug efflux pump in complex with a 
modulatory protein partner® from E. coli. The model defines the 
quaternary organization of the pump, identifies key domain inter- 
actions, and suggests a cooperative process for channel assembly 
and opening. These findings illuminate the basis for drug resist- 
ance in numerous pathogenic bacterial species. 

A small, 49-residue protein, AcrZ, has recently been identified in 
genetic screens as a binding partner of AcrB that affects substrate 
preference®. A mutant strain of E. coli lacking AcrZ is sensitive to some, 
but not all, antibiotics that are exported by the AcrAB-TolC pump. 
AcrZ is conserved in many enterobacterial lineages, suggesting that 
modulation by small proteins may be a recurring theme in the RND 
protein family to which AcrB belongs. 

To investigate their interaction, AcrB was co-expressed with AcrZ 
carrying a carboxy-terminal histidine tag. The partner proteins co- 
purified through Ni** affinity and size-exclusion chromatography 
(Extended Data Fig. 1a), and co-crystals of the AcrBZ complex were 
obtained. Additionally, co-crystals were obtained of a ternary complex 
formed with a designed ankyrin-repeat protein (DARPin)—used in 
previous studies for co-crystallization with AcrB’—that diffracted 
better than co-crystals of the AcrBZ complex alone (Extended Data 
Fig. 1b and Extended Data Table 1). Electron density corresponding to 
AcrZ was apparent in the early, unbiased maps (Extended Data Fig. 2). 
ActrZ folds into a long, predominantly hydrophobic «-helix that fits 
into a wide groove in the transmembrane domain of AcrB (Fig. la, b 
and Extended Data Fig. 2). The AcrBZ complex overlays well with the 
previously reported structure of AcrB bound to another potential 
partner, YajC'® (Extended Data Fig. 3a). The helical axis of AcrZ is 
inclined by nearly 45° with respect to the normal vector of the lipid 
bilayer, and although this is an unusual extent of inclination for a 


transmembrane «-helix, it optimizes the interfacial complementarity. 
The interfacial residues are well conserved among the identified homo- 
logues of AcrZ, suggesting that analogous interactions are likely to occur 
for other RND family proteins (Extended Data Fig. 3b, c). We were able 
to model AcrZ residues 1-46, revealing that the amino terminus is in the 
periplasm and the C terminus is in the cytoplasm, where it interacts with 
the hydrophilic surface of AcrB, near the membrane (Fig. 1b). It seems 
possible that the reported effects of AcrZ on drug sensitivity® might 
originate from the allosteric modulation of AcrB activity by AcrZ. 

To stabilize the complete pump assembly, constructs were prepared 
in which the components were fused (Extended Data Fig. 4a, b). On the 
basis of the crystal structure of the AcrBZ complex, a construct was 
designed in which AcrA is fused through a flexible linker to the N 
terminus of AcrZ, which was predicted to leave the individual structures 
unperturbed and to permit localization of AcrA to the periplasm while 
allowing AcrZ to cross the inner membrane with the correct topology 
and preserving its interactions with AcrB. This fusion harbours a histi- 
dine tag on the C terminus of AcrZ to facilitate Ni’ * affinity purification 
of the fusion protein. In a second engineered construct, we identified a 
loop in the periplasmic domain of AcrB where AcrA could be inserted 
with flexible linkers. Both the AcrA—AcrZ-His, and AcrA—AcrB fusions 
were stable during overexpression and purification. 

At present, there is debate about the compositional stoichiometry of 
the AcrAB-TolC pump, with evidence favouring a 3:6:3 ratio for AcrB: 
AcrA:TolC™*. To enable the assembly of such a complex, we explored 
the co-expression of the fusion pairs AcrA-AcrB and AcrA-AcrZ- 
Hiss. In the course of isolating the co-expressed recombinant proteins, 
we observed that the AcrA-AcrZ-His, fusion co-purified with both 
the AcrA—AcrB fusion and endogenous TolC. We subsequently designed 
co-expression vectors comprising fusion proteins of AcrA—AcrB, AcrA- 
AcrZ-His, and TolC, and found that the proteins remained associated 
during purification (Extended Data Fig. 4c). Assays of minimum inhib- 
itory concentration (MIC) and drug efflux using the AcrAB and AcrAZ 
fusion proteins show that the engineered pump retains partial activity 
(Extended Data Fig. 5). 

The assembly was further purified by the GraFix method”, and 
analysis of the fractions by negative staining electron microscopy read- 
ily led to the identification of drumstick-shaped particles that matched 
the expected size and shape of the pump (Extended Data Fig. 6a). The 
crosslinked particles were subsequently imaged using cryo-electron 
microscopy (cryo-EM), which shows images of isolated and aggre- 
gated particles (Fig. 2a). Initial three-dimensional reconstruction 
was carried out using EMAN2 (ref. 18) without any imposed sym- 
metry (Extended Data Fig. 6b), which suggested the presence of at least 
three-fold symmetry. As TolC and AcrB are known to be homotri- 
mers, a subsequent three-fold symmetry enforced map was indepen- 
dently produced (Extended Data Fig. 6c). We then used RELION 
software’ to refine the particle classification and orientation deter- 
mination, which improved the map greatly. The resolution of the final 
map was estimated to be ~16 A (Extended Data Fig. 7a). This map was 
validated by a tilt-pair analysis (Extended Data Fig. 7b), which is 
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considered to be best practice for assessing the correctness of a low- 
resolution cryo-EM map”. 

The quality of the final cryo-EM map was sufficient to permit dock- 
ing of the crystal structures of TolC, AcrA and the AcrBZ complex with 
only small movements in the flexible linker regions between the domains 
of AcrA (Fig. 2b and Methods). After sequential fitting of AcrA, AcrBZ 
and TolC into the cryo-EM map, the final model has good agreement 
with the density envelope (Fig. 2b, c and Extended Data Fig. 8a). The 
transmembrane portion of TolC is enclosed in a torus that resembles 
the detergent shell observed for aquaporin in the same detergent used 
in this study, namely n-dodecyl-B-b-maltopyranoside (DDM)”'. The 
detergent also forms a layer around the transmembrane portion of AcrB 
(Fig. 2b, c). 

One salient observation of our model is that there is no direct inter- 
action between TolC and AcrB in the assembly, in contradiction to 
earlier models’. Instead, the two proteins are bridged in the periplasm 
entirely through AcrA. This is consistent with in vivo crosslinking and 
pull-down assays, which indicated that AcrA interacts with AcrB and 
TolC independently, whereas no interaction was observed between 
AcrB and TolC”’. Our model also accounts for thermodynamic mea- 
surements, which suggest that the pairwise interactions of AcrA-AcrB 
and AcrA-TolC occur spontaneously, whereas that between AcrB and 
TolC does not*’. The organization of the AcrA-TolC subassembly is in 
agreement with the dimensions of the homologous MexA-OprM com- 
plex from Pseudomonas aeruginosa**”>. 

Our model shows that the assembly comprises an AcrB trimer, an 
AcrA hexamer and a TolC trimer. This is in accord with the 3:6 stoichi- 
ometry of the homologous CusA—CusB heavy-metal efflux complex’’. 
The hexameric assembly of AcrA resembles that of the homologous 
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Figure 1 | The crystal structure of the inner- 
membrane protein AcrB in complex with AcrZ. 
a, Overall view of the AcrBZ trimer. The 
subdomains of an AcrB protomer are labelled PN1, 
PN2, PC1, PC2, DN and DC; the transmembrane 
helices are labelled TM1-6 and TM7-12; and the 
linking helix is labelled Io. AcrB contains a 
structural repeat and the structural units comprise 
PN1-PN2-DN-TM1-6 and PC1-PC2-DC- 
TM7-12. The protein was crystallized with 
DARPin, which is engaged to the AcrB protomer in 
the crystallographic asymmetric unit (not shown 
for clarity). b, Electron density map with 
coefficients 2F, — F. overlaid on the model for 
AcrZ (green sticks); the contour level is 1o. AcrB is 
shown in surface representation, with residues 
coloured according to their relative hydrophobicity 
(hydrophobic in red, hydrophilic in white). 


MacA” (Fig. 3a, b). The o-helical coiled coils (or ‘hairpins’) of AcrA 
pack into a cylinder that interacts with the periplasmic ends of the 
a-helical coiled coils portion of TolC. As TolC has an internal structural 
repeat, there are six quasi-equivalent contact surfaces that interact with 
the AcrA hairpins (Fig. 3a and Extended Data Fig. 8b). 

The membrane-proximal domain and f-barrel domain of AcrA are 
involved in defined interactions with AcrB (Fig. 3a). One protomer of 
AcrA bridges the upper regions of subdomains PC1, PC2 and DC of 
AcrB (protomer 1 in Fig. 3a); this interaction is similar to that between 
CusA and CusB (Fig. 3c). However, the adjacent protomer of AcrA 
(protomer 2 in Fig. 3a) interacts with AcrB in a different manner: here, 
the membrane-proximal domain of AcrA shifts towards the PN2 sub- 
domain of AcrB so that only the upper regions of PN2 and DN contact 
AcrA. The lipoyl domains of AcrA principally interact with each other 
to form a channel that is closed to the periplasm and make no pre- 
dicted interactions with either AcrB or TolC. 

The side-by-side packing of the B-barrel, lipoyl and «helical hair- 
pin domains of the six AcrA protomers generates a funnel-like struc- 
ture with a sealed central channel along the three-fold axis. A chamber 
is formed by the f-barrel and lipoyl domains of AcrA, and the bottom 
of the chamber opens into the funnel of AcrB. The chamber is partially 
constricted near the middle of the lipoyl domain (Fig. 2c and Extended 
Data Fig. 8a). It then continues to a channel created by the helical 
hairpin domains, with a cogwheel shape at the end of these domains, 
like that of MacA (Fig. 3a, b). The modelled docking of the membrane- 
proximal domain of AcrA onto the surface of AcrB is consistent with 
the disruptive behaviour of mutants at that interface**’’. 

A cross-section through the reconstruction reveals a conduit that is 
open to the cell exterior and runs from the transmembrane f-barrel of 


Figure 2 | Cryo-EM images and a pseudo- 

A atomic model of the drug efflux pump. a, A 

: representative raw image of the purified pump. 
ee White circles indicate particles with long axis 
almost normal to the viewing plane; black circles 
show particles with the long axis parallel to the 
viewing plane. Scale bar, 50 nm. b, The 
reconstructed map and pseudo-atomic model. 
TolC and AcrBZ are homotrimers that engage six 
protomers of AcrA to form an assembly with a 
protein mass of 771 kDa. AcrA has a composite 
structure of four linearly discontinuous domains 
connected by flexible linkers* (see also Fig. 3a). ¢, A 
slice through the reconstruction and model, which 
shows the continuous conduit that runs from the 
AcrB funnel through the TolC porin domain. IM, 
inner membrane; OM, outer membrane. 
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ToIC, through the TolC periplasmic domain, and continues all the way 
to the funnel-shaped canyon in the periplasmic domain of AcrB 
(Fig. 2c). The cross-section indicates that the channel is sealed off from 
the periplasm and cytoplasm, and the components in the pseudo- 
atomic model shown in Fig. 2b pack efficiently to leave no gaps. The 
conduit is the likely exit pathway for efflux of substrates, which will be 
delivered into this chamber through successive opening and closing of 
ligand-binding pockets in AcrB, adjacent to the apex of the funnel- 
shaped periplasmic canyon. 

In isolation, TolC assumes a ‘closed’ resting state in which its peri- 
plasmic domain tapers to a near close, but to be accommodated in the 
continuous channel seen in the cryo-EM model, TolC must switch to 
an ‘open’ state**” (Fig. 2c and Extended Data Fig. 8b). The opening of 
TolC must result from its direct interactions with the o-helical hairpins 
of AcrA. In this scenario, AcrB provides a platform for the proper 
assembly of hexameric AcrA suitable for presentation to TolC and 
opening of the channel. The AcrAB-TolC assembly forms in vivo 
without the requirement for substrates or proton-motive force”, sug- 
gesting that exogenous energy may not be required to open TolC in the 
context of the assembled pump. It seems likely that TolC remains open 
throughout the transport cycle, and we propose that the dilation of 
TolC from the closed to the open state is driven by both chelate 
cooperativity (due to the hexameric organization of AcrA) and allos- 
teric cooperativity (associated with the breaking of interprotomer 
interactions in TolC). 

The structure of the efflux pump presented here is likely to be similar 
to homologous assemblies in pathogenic species that infect humans 
such as Vibrio cholerae, P. aeruginosa, Neisseria gonorrhoeae and Salmonella 
enterica. The question arises as to how it might be feasible to counter the 
actions of drug efflux in the treatment of bacterial infections. It seems 
paradoxical that a compound might exist that would achieve this, when 
the pump is well crafted to handle diverse compounds. Perhaps the 
protein-protein contacts that mediate the assembly might be favour- 
able targets for such approaches. 


METHODS SUMMARY 


Recombinant proteins were engineered with histidine tags, overexpressed in E. coli 
and purified chromatographically. The AcrBZ and AcrBZ-DARPin complexes were 
crystallized by vapour diffusion and diffraction data were collected at the Diamond 
Light Source synchrotron facility. The crystal structure was solved using molecular 
replacement and refined. Freshly prepared specimens of the AcrABZ-TolC complex 
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Figure 3 | Modelled interactions of the efflux 
pump components. a, Interactions of the AcrA 
hairpin domain with TolC and the AcrA B-barrel 
and membrane-proximal domains with AcrB. The 
lipoyl domains principally interact with each other 
and make no interactions with AcrB or TolC. Two 
AcrA protomers in the homohexameric ring are 
shown in colour for the hairpin, lipoyl, B-barrel and 
membrane-proximal domains to illustrate the 
domain-domain interactions. Subdomains of one 
protomer of the AcrB trimer are labelled with white 
labels. The membrane-proximal domains of the 
two contiguous AcrA protomers make different 
contacts with the same protomer of AcrB. 

b, Homohexameric subunit organization in the 
structure of MacA, a homologue of AcrA from 
Actinobacillus actinomycetemcomitans (Protein 
Data Bank (PDB) accession 4DK0). ¢, Interactions 
between CusA and CusB in the metal-efflux pump 
from E. coli». 


were applied to glycerol-glutaraldehyde gradients with ultracentrifugation’’. The 
fractions were analysed by electron microscopy after either negative staining or 
rapid freezing under hydrated conditions. Individual particles were chosen from 
the cryo-EM images and were used for two-dimensional projection class averages 
and three-dimensional image reconstruction’*’. For in vivo activity tests of the 
engineered AcrABZ-TolC pump, expression vectors for the fusion proteins and 
controls were transformed into deletion strains. MICs were evaluated in the pres- 
ence of antibiotics and efflux assays were conducted on fluorescent substrates. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Construction of vectors for overexpression of AcrBZ and AcrABZ-TolC complex. 
The acrZ gene was generated by PCR from genomic DNA of E. coli W3110 strain 
using primers AcrZNcol_F: 5'-TGGCCATGGGCTTAGAGTTATTAAAAAGT 
CTGGTATTCGCCGTAATCATGG-3’ and AcrZHis;Sall_R: 5'-GACGTCGACT 
CAGTGGTGGTGGTGGTGATGATTTTGTCCGGGCTGGTCTTTTTTACC-3’. 
The pRSFDuet-1-acrZy;.5 vector was constructed by subcloning the fragment of 
acrZ-Hiss, bounded by Ncol and Sall sites, into the multiple cloning site (MCS) of 
expression vector pRSFDuet-1. 

The pET21a-acrByyiss vector was constructed by subcloning the fragment of acrB, 
bounded by NdeI and Xhol sites, into the MCS of expression vector pET21a. The 
vector pET21a-acrB 414:; was constructed by site-directed mutagenesis using pET21a- 
acrBrys5 a8 a template and primers AcrB_CF: 5'-TGAAGATATCGAGCACAGC 
CATACTGTCGATTGAGATCCGGCTGCTAACAAAGCCC-3’ and AcrB_CR: 
5'-GGGCTTTGTTAGCAGCCGGATCTCAATCGACAGTATGGCTGTGCTC 
GATATCTTCA-3’. 

The acrA gene was amplified using primers AcrANcol_F: 5'-TGGCCATGG 
GCAACAAAAACAGAGGGTTTACGCCTCTGGCG-3’ and AcrAGSx3_R: 5’-T 
GGCCATGGATCCGCCGCCACCAGAGCCACCACCGCCGCTCCCACCGCCA 
CCAGACTTGGACTGTTCAGGCTGAGCACC-3’, to add a poly-GlySer linker to 
the C terminus and a Ncol site at both ends. The NcolI-bounded acrA-polyGlySer 
was subcloned into the Ncol site of pRSFDuet-1-acrZ,,;., resulting in the construct 
pRSFDuet-1-acrA-polyGlySer-acrZyi<5. The tolC gene was amplified using primers 
TolCinf_F: 5’-AAGGAGATATACATATGAAGAAATTGCTCCCCATTCTTA 
TCGGCC-3’ and TolC1392inf_R: 5’-TTGAGATCTGCCATATGTCAATCAG 
CAATAGCATTCTGTTCCGGCGT-3’. The PCR product was then inserted into 
the Ndel site of pRSFDuet-1-acrA-polyGlySer-acrZy;.5 using the In-Fusion clon- 
ing method (Clontech), generating the construct pRSFDuet-1-acrA-polyGlySer- 
acrZ74;55-tolC 1399. 

A BamHI site was inserted into acrB by site-directed mutagenesis using 
pET21a-acrB 4;;;, as a template and primers AcrB_D328F: 5’-CTGAAAATTGTT 
TACCCATAGACGGATCCACCACGCCGTTCGTGAAA-3’ and AcrB_D328R: 
5'-TTTCACGAACGGCGTGGTGGATCCGTCGTATGGGTAAACAATTTTCA 
G-3', resulting in the construct pET21a-acrB 414;,BamHI32s. The acrA gene was 
amplified using primers AcrA_25GSF: 5’-GGATCCGGTGGGAGCGGTGGCGG 
CGGTAGTGGCGGTGGTGGCTCTTGTGACGACAAACAGGCCCAACAAGG-3’ 
and AcrAGSx3_R2: 5'-GGATCCGCCGCCACCAGAGCCACCACCGCCGCTCCCA 
CCGCCACCAGACTTGGACTGTTCAGGCTGAGCACC-3’, to add a poly-GlySer 
linker to both ends of acrA. The resulting DNA fragment was amplified again using 
primers AcrA Infusion_Forward: 5’-CCCATACGACGGATCCGGTGGGAGCG 
GTGGCGGCGGTA-3’ and AcrA Infusion_Reverse: 5’-ACGGCGTGGTGGAT 
CCGCCGCCACCAGAGCCACCACC-3’. The DNA fragment was then inserted 
into the BamHI site of pET21a-acrB 44;,;BamHI32s using the In-Fusion cloning method, 
generating the construct pET21a-acrB32.-polyGlySer-acrA-polyGlySer-acrB329. 
Overexpression and purification of the AcrBZ and the AcrBZ-DARPin com- 
plexes. The constructs pET21a-acrB4y;,; and pRSFDuet-1-acrZyi,;5 were trans- 
formed into E. coli strain C43(DE3) AacrAB*'. The C-terminal histidines of AcrB 
were substituted to prevent nonspecific association with the metal affinity matrix. 
Cells were grown in 2X YT medium with 100 pg ml carbenicillin and 50 ug ml 
kanamycin at 37 °C until the culture reached an absorbance, at 600 nm, of 0.5-0.6 
and was then induced by the addition of 0.5 mM isopropyl 1-thio-B-p-galactopyr- 
anoside (IPTG) at 18°C overnight. Cell pellets were resuspended in lysis buffer 
(400 mM sodium chloride, 20 mM Tris-HCl, pH 8.0) with 1 tablet per 50 ml EDTA- 
free protease inhibitor cocktail tablets, 5 Uml~' DNase I and 5 mg ml? lysozyme, 
and the mixture was stirred at 4 °C for 1 h to digest the cell wall. The cells were lysed 
using a homogenizer (EmulsiFlex) at 15,000 psi for eight passages. Cell debris was 
pelleted by centrifugation at 9,000g for 30 min. Cellular membrane was pelleted by 
ultracentrifugation at 125,755g for 3h. 

Membrane pellets were resuspended in lysis buffer with protease inhibitors and 
were solubilized by adding 1.5% DDM and stirring at 4°C for 3h. Debris was 
pelleted by ultracentrifugation at 125,755g for 30 min. Imidazole was added to the 
membrane solution to a final concentration of 10 mM. Histidine-tagged AcrBZ 
complex was purified by nickel affinity chromatography using a HiTrap chelating 
column (GE Healthcare Life Sciences) equilibrated with GF buffer 1 (400 mM 
sodium chloride, 20 mM Tris-HCl, pH 8.0, 0.05% DDM) containing 20 mM imi- 
dazole. The column was washed with 50 mM and 75 mM imidazole added to GF 
buffer 1, respectively. Purified AcrBZ complex was eluted with 500 mM imidazole 
in GF buffer 1, concentrated and loaded onto a Superose 6 column equilibrated 
with GF buffer 1. Fractions containing purified AcrBZ complex were pooled and 
concentrated to 15-20 mg ml‘ usinga Vivaspin concentrator (MWCO = 100 kDa) 
and dialysed overnight against sample buffer (10 mM HEPES pH 7.5, 50 mM sodium 
chloride, 0.03% DDM) using a 100 kDa MWCO dialysis membrane to decrease the 
detergent concentration. 


The DARPin gene was synthesized and protein was overexpressed and purified 
as described’. Purified DARPin and AcrBZ complex were mixed at a molar ratio 
of 1:2 (AcrBZ monomer:DARPin monomer). The mixture was diluted with GF 
buffer 2 (20 mM Tris-HCl, pH 7.5, 150 mM sodium chloride, 0.03% DDM) to a 
concentration of 2-3 mg ml | incubated at 4°C overnight, then concentrated to 
0.5 ml using a Vivaspin concentrator (MWCO = 100 kDa) and loaded onto a Superose 
6 column equilibrated with GF buffer 2. Fractions containing purified AcrBZ-DARPin 
complex were pooled and concentrated to 15-20 mg ml ' using a Vivaspin con- 
centrator (MWCO = 100 kDa) and dialysed overnight against sample buffer using 
a 100 kDa MWCO dialysis membrane; the final concentration was 10-15 mg ml~ - 
Crystallization of AcrBZ and AcrBZ-DARPin. The AcrBZ and AcrBZ-DARPin 
complexes were diluted to 10 mgml' using sample buffer. 9 mM n-octyl-B-p- 
thioglucopyranoside (90 mM) was mixed with the AcrBZ complex before the crys- 
tallization trials. The AcrBZ crystals were grown at 20 °C using the hanging-droplet 
vapour diffusion method by mixing 4 1] of AcrBZ complex with 2 ul of reservoir 
solution (100 mM tricine, pH 7.4, 50 mM lithium sulphate, 5 mM cadmium chlor- 
ide hydrate, 7% PEG 3000, 10% glycerol). The AcrBZ-DARPin complex was incu- 
bated with puromycin at 1 mM for 3h at 4 °C before crystallization trials. Crystals 
were grown at 20°C using the sitting-droplet vapour diffusion method by mixing 
400 nl of AcrBZ-DARPin complex with 200 nl of reservoir solution (100 mM 
HEPES, pH 7.5, 10 mM MgCl, and 12% (w/v) PEG 3350). Crystals appeared 24h 
after setting up the crystallization trials and reached maximal size in 1 week. The 
crystals were transferred briefly into reservoir solution supplemented with 25% 
glycerol as cryoprotectant before flash freezing in liquid nitrogen. 
Crystallographic data collection and structure refinement. Data sets were col- 
lected using beamlines 104-1 and 124 at the Diamond Light Source. The diffraction 
data were processed using X-ray Detector Software (XDS)” and scaled using 
SCALA”. Structures were solved by molecular replacement using Phaser with chain 
A of AcrB (PDB accession 4DX5)’ as a search model and refined using Crystallography 
and NMR System (CNS)***°, PHENIX** and REFMACS (ref. 37) with reference struc- 
ture restraints**. Coot was used for modelling”. Data collection and refinement 
parameters are presented in Extended Data Table 1. 

Both the AcrBZ and AcrBZ-DARPin complexes crystallized in space group 
R32. The asymmetric unit is occupied by one protomer of AcrB and one protomer 
of AcrZ. Maps calculated from molecular replacement using only the AcrB pro- 
tomer revealed clear electron density for AcrZ in both complexes (see Extended 
Data Fig. 2). For the AcrcBZ-DARPin complex, unbiased density was also present 
that could fit the DARPin model. Density was also apparent that matched the 
modelled detergent and acyl chains in the 1.9 A resolution structure of E. coli AcrB’ 
(PDB accession 4DX5). These were included in the model, as well as additional 
acyl chains that were located mostly in the intra-protomer space in the transmem- 
brane region of AcrB. For both AcrBZ and AcrBZ-DARPin complexes, the AcrB 
homotrimer is formed through crystallographic symmetry, so that any potential 
non-equivalence of the protomers is consequently lost. 

Expression and purification of the efflux pump assembly. The constructs 
pET21a-acrB,2¢-polyGlySer-acrA-polyGlySer-acrB32, and pRSFDuet-1-acrA-polyGlySer- 
actZy45-tolC}399 were transformed into E. coli strain C43(DE3)AacrAB. The pro- 
cedure for overexpression and purification of AcrABZ-TolC is similar to that of 
AcrBZ, with minor modifications, as follows. Cells were induced with 0.15 mM 
IPTG at 20 °C overnight, harvested and resuspended in lysis buffer 2 (20 mM Tris- 
HCl, pH 7.5, 400 mM NaCl) for membrane preparation. Cellular membrane was 
solubilized with 1.5% DDM in lysis buffer 2. The AcrcABZ-TolC complex purified 
by nickel affinity chromatography was loaded onto a Superose 6 column equili- 
brated with GF buffer 3 (50 mM HEPES pH 7.5, 400 mM NaCl, 0.03% DDM). 
Fractions containing purified AcrABZ-TolC complex were pooled and concen- 
trated to 1 mg ml * using a Vivaspin column (MWCO = 100kDa). 

Transport assays. Transport activity was measured as described* with some 
modifications. Plasmids were propagated in the triple knockout E. coli strain 
MCAtolCAacrAB**"", a derivative of E. coli MC1061 converted to a (DE3) lysogen. 
All experiments used basal levels of protein expression without induction with 
IPTG. Cells were grown in LB-Broth Miller medium (Formedium) containing the 
appropriate antibiotic(s) with shaking at 37°C until an OD¢eonm of 0.5 was 
reached. The cells were harvested by centrifugation at 3,000g for 10 min at 4°C, 
then washed three times by resuspension in 50 mM potassium phosphate buffer, 
pH 7.0, containing 5 mM MgSO, and sedimented by centrifugation at 3,000g for 
10 min at 4 °C. The cells were then resuspended in the same buffer to an OD¢60 nm Of 
0.5 and incubated for 3 min at room temperature (25 °C) in the presence of 25 mM 
glucose to energize the cells. The fluorescence measurement was started, and 60s 
later 2 1M ethidium bromide or 0.25 11M trimethylammonium-diphenylhexatriene 
(TMA-DPH) was added. The fluorescence was followed as a function of time in a 
PerkinElmer LS 55B fluorimeter. Excitation and emission wavelengths and excita- 
tion and emission slit widths were 500, 580, 5 and 10 nm respectively for ethidium 
bromide; and 350, 425, 5 and 5nm respectively for TMA-DPH. Initial substrate 
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transport rates were determined between 85 to 185s and 75 to 95s for ethidium 
bromine and TMA-DPH, respectively. Assays were performed twice per day for 
three separate days, with different cell preparations used each day. 

GraFix preparation of the efflux pump assembly. The GraFix procedure for the 
AcrABZ-TolC complex was carried out as described’””’. Gradients for GraFix 
were generated by mixing GraFix buffer 1 (50 mM HEPES, pH 7.5, 400 mM NaCl, 
0.03% w/v DDM, 10% v/v glycerol) and GraFix buffer 2 (50 mM HEPES, pH 7.5, 
400 mM NaCl, 0.03% w/v DDM, 30% v/v glycerol, 0.15% v/v glutaraldehyde) 
using a gradient mixer (Gradient Master 107, BioComp Instruments), following 
the manufacturer's recommendations for determining the parameters. 100 pmol 
of AcrABZ-TolC complex was loaded onto the top of the buffering cushion. 
Ultracentrifugation was carried out at 111,845g, 4°C for 18h (SW60 rotor, 
Beckmann). After centrifugation, the gradients were fractionated at 4 °C by using 
a capillary to pump the gradient out from bottom to top, taking fractions of 200 ul. 
The same fractions from different centrifuge tubes were pooled, respectively. Glycine 
(1M, pH 7.5) was added to the fractions to a final concentration of 80mM to 
quench further crosslinking, and the mixtures were concentrated to 200 ll using a 
Vivaspin concentrator (MWCO = 100kDa). The concentrated protein samples 
were diluted to 1 ml using GF buffer 3, and were concentrated to 100 iil again to de- 
crease the concentration of glycerol. The AcrABZ-TolC samples were flash frozen 
in liquid nitrogen and stored at —80 °C. 

Negative-stain electron microscopy. Continuous carbon electron microscopy 
grids (Agar Scientific) were glow discharged. Sample adsorption was achieved by 
placing the grid carbon-side down on a 10 ul drop of protein solution at a concen- 
tration of approximately 10 1g ml~'. Adsorption times were determined empir- 
ically. After blotting to dryness, the grid was washed by placing it onto a drop of 
deionized HO for 5s, blotted dry, and then repeated. The protein was stained by 
transferring the grid toa drop of 2% w/v uranyl acetate for 30 s, then dried and stored 
at room temperature. The sample was then imaged at X20,000 magnification and 
120 keV in a FEI Tecnai G’ microscope at the Multi Imaging Centre, University of 
Cambridge. 

Cryo-EM specimen preparation and data collection. Samples were embedded 
in vitreous ice as per the following method. A 2.01 aliquot at approximately 
2mg ml! was applied onto holey carbon film supported by a 200-mesh R1.2/1.3 
Quantifoil grid (Quantifoil) that had been previously washed and glow discharged. 
The grid was blotted and rapidly frozen in liquid ethane using a Vitrobot IV (FEI) 
with constant temperature and humidity during the process of blotting, and the 
grid was stored in liquid nitrogen before imaging. All grids were screened on a 
JEM3200FSC cryo-electron microscope (JEOL) operated at 300kV, spot size 2, 
condenser aperture 70 1m, objective aperture 60 jm, and energy slit of the in-column 
filter of 20 eV. Images were recorded with a direct detection device (DDD) (DE-12 
3kx4k camera, Direct Electron, LP) operating in movie mode at a recording rate of 25 
raw frames per second. A magnification of X20,000 (corresponding to a calibrated 
sampling of 2.47 A per pixel) and a dose rate of 25 electrons s_' A~* were used to 
acquire data for each specimen area with a total exposure time of 2s. A total of 1,281 
DDD images using movie frame mode were collected with a defocus range of 2-4 um. 
Cryo-EM data processing and resolution evaluation. Particle images were 
manually boxed with the EMAN2 program e2boxer.py’* with a box size of 256 X 
256 pixels using the averaged sum of 48 raw frames per specimen area. The final 
frame average was computed from averages of every five consecutive frames to 
correct for the specimen movement during exposure similar to the procedure 
previously described’’. The particle intensity in each frame was weighted according 
to a radiation damage model (courtesy of B. Bammes). Defocus of the particles in 
each frame average was determined by EMAN2 program e2ctf.py™. The first recon- 
struction was done without any symmetry imposition or any initial model, using 
EMAN2 (ref. 18). We started with 12,000 particle images and 8,400 particle images 
were selected manually according to the similarity in appearance to the class averages. 
The first map without any symmetry imposition appeared to have a six-fold sym- 
metry (Extended Data Fig. 6b), which justified applying a three-fold symmetry 
enforcement (Extended Data Fig. 6c). The symmetry-free and symmetry-imposed 
maps look similar except for a slight difference in contrast. Then, we used RELION”” 
to perform two additional steps of particle orientation refinement of the 8,400 
selected particle images. Using the EMAN2-generated map (Extended Data Fig. 6c), 
low-pass filtered to 60 A, we performed a particle orientation refinement by RELION 
with search angle of 7.5 degrees for 14 iterations and subsequently with an angular 
sampling of 1.8 degrees for 11 iterations. The resolution of the resulting map is 21 A 
based on 0.143 FSC criterion (Extended Data Fig. 7a). Next, this map was sharpened’* 
and low-pass filtered to 30 A before preforming a second round of particle orienta- 
tion refinement by RELION with an angular sampling of 1.8 degrees for 8 iterations. 
The resolution of the final map is estimated to be 16 A by the 0.143 FSC criterion 
(Extended Data Fig. 7a). The final map displayed in Fig. 2b, c has been sharpened 
and filtered to 16 A. 
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Cryo-EM map tilt-pair validation. To further validate the cryo-EM map (Fig. 2b), 
we carried out tilt-pair analysis” of the AcrABZ-TolC complex. Pairs of images 
were recorded on a JEM2010F electron microscope at a magnification of 30,000 
with a relative tilt of 10 degrees, along a known axis. Each pair of particle images 
thus has a known experimental relative tilt. We then determined the orientation 
of each particle in the pair using the final reconstructed map as a reference. The 
relative tilt between these computed orientations should match the experimental 
tilt. Each point in Extended Data Fig. 7b represents one pair of particles in the tilt 
pair, with radius representing computed relative tilt, and azimuth representing tilt 
angle. Ideally all points would lie in exactly the same location; however, there is 
always some uncertainty in orientation determination, exacerbated in this case by 
radiation damage in the second image of the tilt pair and the strong pseudo hex- 
agonal symmetry. As previously observed”, if the map is incorrect, the relative 
angles will not correlate at all, and produce a nearly random distribution over the 
sphere. A clear cluster as observed in our case is an indication of successful validation, 
and our computed tilt angle of 10.44 degrees is a good match with the microscope 
setting of 10 degrees. The presence of some points at mirrored positions across the 
origin is an indication of weak handedness in the map, meaning that the map and 
its mirror image are difficult to distinguish at this resolution in some orientations. 
Model docking. The merged map was low-pass filtered before rigid body docking 
of crystal structures using Coot and UCSF Chimera” (http://www.cgl.ucsf.edu/ 
chimera). The symmetric partially opened model of TolC was used to prepare a 
fully opened model (PDB accession 2XMN)”. A homologous model of the inner 
sets of coiled coils was devised based on the outer sets of coiled coils, which was 
used to replace the inner sets of coiled coils. The uncoiling movement of the inner 
sets of coiled coils produced a fully opened TolC’. 

The B-barrel domain, lipoyl domain and «-helical hairpin domain of each AcrA 
protomer were modelled with chain C in the asymmetric unit of the crystal structure 
of AcrA (PDB accession 2F1M)*. A homology model of the membrane proximal 
domain of AcrA was devised based on the structure of MexA (PDB accession 2V4D)”. 

The individual domains of AcrA were fitted into the cryo-EM reconstruction 
using Coot, according to the features of the cryo-EM map and prior knowledge of 
the homohexameric MacA assembly (PDB accessions 3FPP and 4DK0). The closed- 
state TolC (PDB accession 1EK9), partially opened (PDB accession 2XMN) and fully 
opened model of TolC were separately fitted into the cryo-EM reconstruction, 
respectively. The AcrBZ trimer, the TolC trimer and the AcrA hexamer were treated 
as individual rigid bodies. These three rigid bodies were automatically fitted into the 
cryo-EM map by sequential fitting using Chimera. The AcrABZ-TolC model gen- 
erated by Chimera was slightly adjusted manually to optimize the local fit into the 
density using Coot”. The cross-correlation coefficient for fitting the AcrABZ-open- 
TolC model to the map is 0.924 and was calculated using the ‘fit in map’ function of 
Chimera. 
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Extended Data Figure 1 | Co-purification of the AcrBZ and AcrBZ- chromatography step. AcrZ has a heptahistidine tag on the C terminus 
DARPin complexes. a, Gel filtration profile of AcrBZ complex and SDS- as an IMAC affinity tag, and the C-terminal histidines of AcrB have been 
polyacrylamide gel electrophoresis (SDS-PAGE) analysis of the peak fractions. removed to prevent its binding to the matrix. Standard proteins 

b, Gel filtration profile of the AcrBZ-DARPin complex and SDS-PAGE thyrogloblin (669 kDa) and aldolase (158 kDa) (Bio-Rad) were eluted from the 
analysis. The proteins were enriched by immobilized metal affinity same column at volumes of 11.4 and 14.7 ml, respectively. See also Fig. 1. 


chromatography (IMAC; results not shown) before the size-exclusion 


©2014 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 2 | Validation of the AcrBZ crystal structure. An 
AcrB protomer taken from the refined structure of the AcrBZ-DARPin 
complex at a resolution of 3.3 A was used in molecular replacement of the 3.7 A 
data of the AcrBZ complex (Extended Data Table 1). The model of AcrB 
without AcrZ was refined with REFMAC* jelly-body and reference structure 
restraints, and a difference map calculated in which AcrZ does not 
contribute to the phases. Shown are the positive features of the difference map 


in the vicinity of AcrZ (green carbon atoms) taken by superposing the AcrB 
protomers from the AcrBZ-DARPin and AcrBZ (red) structures. The 
unbiased map shows the presence of AcrZ and indicates that the presence of 
DARPin does not disrupt the interactions with AcrB. The model of the 
AcrBZ-DARPin complex at a resolution of 3.3 A was refined without AcrZ to 
generate a simulated annealing omit map, which confirmed the location of 
AcrZ (data not shown). See also Fig. 1. 
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Extended Data Figure 3 | The AcrBZ complex resembles AcrB-YajC, and 
AcrBZ interactions involve conserved residues. a, Superposition of AcrB- 
YajC (orange and yellow; PDB accession 2RDD) onto AcrBZ (red and green). 
See also Fig. 1. We were not able to identify an interaction between AcrB 
and C-terminal histidine-tagged YajC from E. coli, suggesting that the 
interaction is less avid than the AcrB-AcrZ pairing (data not shown). It is 
interesting to note that the contacting surface is also conserved in another RND 
protein, SecDF, which is involved in protein translocation and membrane 
insertion”. Therefore, it seems likely that SecDF might be modulated by a 
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similar helical peptide. Indeed, YajC forms a complex with SecDF”” and could 
act as an allosteric modulator in protein biogenesis. b, Sequence conservation 
of AcrZ. The secondary structure is annotated at the top and asterisks 
indicate residues that directly contact AcrB. c, Sequence variation of the surface 
of AcrB homologues, showing conservation of the surface that 

contacts AcrZ. Right-hand side includes the bound AcrZ in stick representation 
(green). The colour spectrum ranges from purple (most conserved) to blue 
(least conserved). This figure was made using ConSurf*!. 
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Extended Data Figure 4 | Schematic representations and purification of the _ protein, and, owing to the restriction site used for cloning, a single glycine 


fusion proteins used to assemble the efflux pump. a, The AcrA-AcrB residue was inserted after the start codon in both AcrA and AcrZ. 

fusion, with two flexible poly(Gly-Ser) linkers. The two C-terminal histidines of _ The flexible poly(Gly-Ser) linker permits the protomers to manoeuvre. 

AcrB have been removed to prevent binding to the NTA matrix during co- c, Co-purification of TolC with the AcrABZ complexes. SDS-PAGE of the 
purification. b, The AcrA-AcrZ-Hiss fusion and TolC co-expression eluate from gel filtration following nickel affinity chromatography purification. 
construct. The numbers above the bars correspond to the residues of the See also Fig. 2. 
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Extended Data Figure 5 | The fusion assemblies can drive efflux in vivo. 

a, b, Drug-transport assays of the AcrAB and AcrAZ fusion proteins in the 
MCAtolCAacrAB strain show that the engineered pump retains partial 
activity for efflux of ethidium bromide (a) and trimethylammonium- 
diphenylhexatriene (b). The results show representative traces from six 
biological replicates. For ethidium bromide, initial influx rates of 10.3 + 0.6, 
6.4 + 0.6 and 3.6 + 0.4 arbitrary units (AU) per min were obtained for the 
non-expressing control cells, the cells expressing both fusion proteins plus TolC 
and the wild-type cells expressing AcrAB-TolC, respectively. For 
trimethylammonium-diphenylhexatriene, initial influx rates of 187 + 6, 
151+ 3 and 54+ 2 AUmin * were obtained for the non-expressing control 
cells, the cells expressing both fusion proteins plus TolC and the wild-type cells 


expressing AcrAB-TolIC, respectively. c, MIC data on antimicrobial 
susceptibility of E. coli MCAtolCAacrAB cells expressing wild-type AcrA- 
AcrB-TolC or the fusion proteins with TolC. These data indicate that the fusion 
of AcrA—AcrZ and the insertion of AcrA into AcrB both diminish the capacity 
for drug resistance. The plasmids pET21a, pET21a-AcrB or pET21a-acrB329- 
polyGS-acrA-polyGlySer-acrB329 (AcrAB), together with plasmid pRSFDuet-1- 
acrA-polyGlySer-acrZps5-tolC (AcrAZ + TolC), were transformed into 
MCAtolCAacrAB. Cells were tested for their ability to resist increasing 
concentrations of cytotoxic drugs. As a positive control, cells were transformed 
with plasmid pBAD_AcrA+AcrB+TolIC, which encodes for the native AcrA- 
AcrB-TolC efflux pump. 
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Extended Data Figure 6 | Electron microscopy images and class averages of 
the AcrABZ-TolC complex. a, Negative-stain electron microscopy images of 
the purified AcrABZ-TolC complex after GraFix treatment. White circles 
indicate particles with long axis almost normal to the viewing plane; black 
circles show particles with the long axis parallel to the viewing plane. 

b, c, Class averages from cryo-EM data and reconstructions with C1 symmetry 
and with C3 symmetry, respectively. The galleries in the side panels show 
representative two-dimensional class averages of the purified pump. The top 
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10nm 


C3 
symmetry 


shows views perpendicular to the long axis of the drumstick shape, 

the middle shows inclined views, and the bottom shows views along the long 
axis. The reconstructed images and cross-sections indicate the 

presence of a six-fold symmetry, which is consistent with six AcrA 
protomers. The pseudo-atomic model has three protomers each 

of AcrB, AcrZ and TolC, and as TolC and AcrB each have an internal 
structural repeat, they have pseudo six-fold symmetry at low resolution (cut 
view). The maps are consistent with each other. See also Fig. 2. 
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Extended Data Figure 7 | Resolution estimation and validation of the cryo- _ The radius indicates the computationally determined amount of tilt, and the 
EM map. a, Fourier shell correlation (FSC) of two independently determined azimuth indicates tilt direction. The red circle denotes particle pairs that 
cryo-EM maps of the efflux pump assembly after their alignment by cluster around the experimental tilt axis geometry, thus validating the map in 
Foldhunter”. b, Tilt-pair validation of the efflux pump assembly. Each point __ Fig. 2. 

represents a pair of particles with experimentally known relative tilt. 
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Extended Data Figure 8 | Sections through the electron microscopy map. shown with TolC in the closed state, as seen in the crystal structure of the 
a, Slices 1-4: view of planes normal to the three-fold symmetry axis of the isolated protein (left; PDB accession 1EK9), the partially open structure, 
pump. b, Slice 5 of the model assembly showing the opening of the TolC made by mutations (middle; PDB accession 2XMN) and the modelled fully 
periplasmic domain. The same transverse plane through the cryo-EM map is _ opened structure (right). 
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Extended Data Table 1 | Crystallographic data and refinement statistics 


Crystal AcrB/AcrZ/DARPin AcrB/AcrZ 

Space group R32 (H32) R32 (H32) 

Cell dimensions in hexagonal 

setting 

a,b,c (A) 145.01, 145.01, 538.34 146.19, 146.19, 543.00 
o,B,y (°) 90, 90, 120 90, 90, 120 

Resolution (A) 30.0-3.30 (3.46-3.30)* 24.89-3.69 (3.99-3.69) 
Biss 13.4 (98.6) 12.6 (117.0) 

Total number of observations 166,656 (22,416) 244483 (50138) 
Unique reflections 33,271 (4368) 23077 (4710) 
Completeness, % 99.8 (100.0) 94.5 (94.8) 

I/ol (outer shell) 11.0 (1.6) 15.1 (2.1) 

<I> half-set correlation 0.998 (0.585) 0.999 (0.791) 

B values Wilson plot/average in 76.6/87.7 148/117.7 

structure, A? 

Reflections used in refinement 31,614 21,912 

Geometry: Ramachandran 95.2: 4.6: 0.2 94.9: 4.9: 0.3 
favoured, allowed, outliers, % 

Rms deviation from ideal 0.012, 1.456 0.011, 1.57 


values for bond lengths (A) and 

bond angles (°) 

Number of non-hydrogen 9452 7997 
atoms in refinement 

Refinement (R/Rfee 5% 28.3/32.4 33.9/36.1 


reflections in test set), % 


*Values in parentheses are for highest-resolution shell 
The PDB deposition codes for model and structure factors of AcrB/Acrz/DARPin and AcrB/AcrZ 


are 4C48 and 4CDI respectively. 
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Structural basis of Sec-independent membrane 


protein insertion by YidC 


Kaoru Kumazaki!**, Shinobu Chiba®*, Mizuki Takemoto”, Arata Furukawa‘, Ken-ichi Nishiyama’, Yasunori Sugano’, 
Takaharu Mori®°, Naoshi Dohmae’, Kunio Hirata’, Yoshiko Nakada-Nakura®, Andrés D. Maturana’, Yoshiki Tanaka’, 
Hiroyuki Mori’, Yuji Sugita®, Fumio Arisaka"', Koreaki Ito?, Ryuichiro Ishitani+”, Tomoya Tsukazaki*!? & Osamu Nureki*? 


Newly synthesized membrane proteins must be accurately inserted 
into the membrane, folded and assembled for proper functioning. 
The protein YidC inserts its substrates into the membrane, thereby 
facilitating membrane protein assembly in bacteria; the homologous 
proteins Oxal and Alb3 have the same function in mitochondria and 
chloroplasts, respectively'”. In the bacterial cytoplasmic membrane, 
YidC functions as an independent insertase and a membrane chap- 
erone in cooperation with the translocon SecYEG* *. Here we pres- 
ent the crystal structure of YidC from Bacillus halodurans, at 2.4 A 
resolution. The structure reveals a novel fold, in which five conserved 
transmembrane helices form a positively charged hydrophilic groove 
that is open towards both the lipid bilayer and the cytoplasm but 
closed on the extracellular side. Structure-based in vivo analyses reveal 
that a conserved arginine residue in the groove is important for the 
insertion of membrane proteins by YidC. We propose an insertion 
mechanism for single-spanning membrane proteins, in which the 
hydrophilic environment generated by the groove recruits the extra- 
cellular regions of substrates into the low-dielectric environment of 
the membrane. 

The Sec translocon, a protein-conducting channel conserved in all 
three phylogenetic domains, translocates secretory proteins across the 
membrane and inserts membrane proteins into the membrane by an 
hourglass-shaped pore formed by ten transmembrane helices**. In 
bacteria, another membrane protein, YidC, is involved in the folding 
and insertion of many membrane proteins”""’, such as subunit c of the 
FoF,-ATPase (Foc), and thus is essential for cell viability’°. YidC is con- 
sidered to function as both a Sec-dependent membrane chaperone and 
a Sec-independent insertase. In the Sec-independent pathway, YidC di- 
rectly interacts with the translating ribosome to mediate the insertion 
of several single-spanning or double-spanning membrane proteins'*””. 
Previous studies have suggested that YidC forms a face-to-face dimer, cre- 
ating a channel at its dimer interface'®’*. By contrast, another research 
group has reported that a monomer of membrane-embedded YidC is 
sufficient for binding to the substrate-translating ribosome”’. Oxal, the 
mitochondrial homologue of YidC, reportedly functions as a voltage- 
gated membrane channel, as well as a membrane protein insertase, prob- 
ably by forming a tetramer”. However, the lack of a high-resolution 
structure of YidC has limited our understanding of the molecular mech- 
anism of YidC-mediated membrane protein insertion. 

Members of the genus Bacillus have two yidC genes, encoding YidC1 
and YidC2. We determined the crystal structures of two constructs of 
B. halodurans YidC2 (YidC37_266 and YidC37_267), which lack both the 
amino-terminal signal and the carboxy-terminal non-conserved sequences 


(Fig. 1, Extended Data Table 1 and Extended Data Figs 1 and 2). A similar 
deletion variant of Bacillus subtilis YidC1 (also known as SpolIJJ), an 
orthologue of B. halodurans YidC2 with 49.6% sequence identity, has 
comparable activity to the full-length protein in vivo (K248stop in 
Fig. 2a). Although the crystallographic asymmetric units of YidCz7_266 
and YidC,7_267 contain one and two YidC molecules, respectively, the 
molecules in the crystalline lattice do not appear to form an effective olig- 
omer, such as a face-to-face dimer’®"* (Extended Data Fig. 3a, b). Recent 
fluorescence correlation spectroscopy and cryo-electron microscopy 
analyses of YidC bound to a translating ribosome showed that YidC 
exists as a monomer in detergent solution and in lipid membranes'””. 
Consistent with these findings, our analysis using size exclusion chro- 
matography coupled to multi-angle laser light scattering (SEC- MALLS) 
also showed that YidC exists as a monomer in detergent solution (Ex- 
tended Data Fig. 3c—e). Because the overall structures of YidC27_266 and 
YidC,7 367 are nearly identical (with a root mean square deviation of 1.79 
A over residues 27-266), we mainly describe the structure of YidC27_266. 

The YidCz7_26¢ structure consists of the N-terminal E1 region, the 
C-terminal C3 tail and the core region: the core region is composed of 
transmembrane helices 1-5 (TM1-5), connected by two cytoplasmic (C1 
and C2) regions and two extracellular (E2 and E3) regions (Fig. 1c). The 
E1 region consists of the EH1 helix, whereas the C1 region forms a 
hairpin-like structure composed of two helices (CH1 and CH2) con- 
nected bya short loop. The residues in the C2 region and the C3 tail are 
structurally disordered. The EH1, CH1 and CH2 helices protrude from 
the core region and lie nearly parallel to the plane of the membrane. The 
EH1 helix is amphipathic, whereas the CH1 and CH2 helices contain 
mainly hydrophilic residues. This observation suggested that one face of 
the EH1 helix is embedded in the membrane, whereas the CH1 and CH2 
helices are exposed to the solvent (Fig. 1a). This idea is also consistent with 
theoretical calculations using an implicit membrane model (see Methods). 

A comparison of the structures of YidC37 366 and YidC,7 367 sug- 
gested that the C1 region, which does not interact with the remainder 
of YidC, is flexible (Extended Data Fig. 4a-c). The CH1 helix forms a 
continuous helix with the TM1 helix, which is kinked at the conserved 
proline residues P78 and P94 (Fig. 1c). The proline residues and the par- 
tially disordered flexible loop connecting the CH2 and TM2 helices 
(residues 130-140), which is likely to be embedded in the membrane, 
could enhance the flexibility of the C1 region. Indeed, the C1 region has 
higher B-factors than the other regions (Extended Data Fig. 5a). A mole- 
cular dynamics simulation also suggested that the position of the C1 
region fluctuates greatly in the lipid bilayer environment (Extended Data 
Fig. 6a, b). 
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Figure 1 | Overall structure of YidC. a, Cartoon representations of Bacillus 
halodurans YidC viewed transversely through the membrane (left) and from 
the extracellular fluid (right). The structure is coloured from blue to red from 
the N to the C terminus. b, Surface model representations of YidC coloured 
according to electrostatic potential, ranging from blue (+20 kT/e) to red 


To further investigate the functional importance of the C1 region, 
we performed a structure-based genetic analysis of B. subtilis SpollIJ. 
The membrane insertion of MifM, a single-spanning membrane protein, 
is mediated by the SpolIIJ pathway”’. In this analysis, we measured the 
B-galactosidase (LacZ) activity of a YidC2-LacZ translational fusion 
protein: this activity increased in response to a defect in the SpolIIJ- 
dependent insertion of MifM (Extended Data Figs 7a and 8). Two Cl- 
deletion mutants (Extended Data Fig. 7b) exhibited higher LacZ activities 
than wild-type SpolIIJ (Fig. 2a), suggesting that the C1 region is crucial 
for the YidC-mediated membrane insertion of MifM. We then performed 
a growth complementation analysis using B. subtilis (as described in 
Extended Data Fig. 7c), which also suggested that the C1 region is impor- 
tant for SpolIIJ activity (Fig. 2b). 

The extracellular halves of the TM1-5 helices, in addition to the E2 
region, are tightly packed with hydrophobic side chains (Extended Data 
Fig. 5b, c), whereas the cytoplasmic halves of the TM1-5 helices loosely 
interact with each other to form a groove (~2,000 A®). This groove con- 
tains many hydrophilic residues, including the conserved T68, R72, Q82, 
Q142, Q187, N248 and Q254 residues, and thus generates a hydrophilic 
environment in the lipid bilayer (Fig. 2c and Extended Data Fig. 1). The 
conserved R72 residue, the only charged residue in this groove, pro- 
trudes into the centre of the groove and creates a strong positive elec- 
trostatic potential in the groove (Fig. 1b, d). This hydrophilic groove is 
open to both the cytoplasmic side and the membrane interior. By con- 
trast, the groove is sealed towards the extracellular side by the hydro- 
phobic core and is not accessible from that side (Fig. 1d). The hydrophobic 
core, which consists of the hydrophobic residues in the extracellular half 
of YidC, has lower B-factors than the other regions, suggesting that it 
is rigid (Extended Data Fig. 5). Structure-based genetic analyses sug- 
gested the importance of this hydrophobic core for the function of YidC 
(Fig. 2a—c and Extended Data Fig. 7b). A comparison of the YidC7_¢6 and 
YidC7_267 structures revealed that the groove in YidCz7_267 is narrower 
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(—20kT/e), where kT is thermal energy and e is the elementary charge. 

c, Topology diagram of YidC, coloured as in a. The grey area represents the 
cytoplasmic membrane. d, Cut-away molecular surface representations, viewed 
transversely through the membrane, coloured as in b. 


than that in YidC,7_266, suggesting that the groove has structural flex- 
ibility (Extended Data Fig. 4d). Molecular dynamics simulations also 
suggested that the size of the hydrophilic groove slightly fluctuated during 
a 1,000-ns simulation (Extended Data Fig. 6a). By contrast, the overall 
architecture of the core region, and the structure of the hydrophilic 
groove, remained stable in the lipid bilayer (Extended Data Fig. 6b), and 
the groove was constantly filled with ~20 water molecules (Extended 
Data Fig. 6c). The extracellular side of the groove remained sealed by 
the hydrophobic protein residues and the aliphatic lipid chains and thus 
was impermeable to ions and water molecules during molecular dynamics 
simulations. Taking these findings together, YidC appears to provide a 
flexible hydrophilic groove in the membrane that is open towards both 
the cytoplasmic side and the membrane interior but tightly sealed on 
the extracellular side. 

To investigate the functional importance of the hydrophilic groove, 
we examined the membrane insertion activities of SpoIIIJ molecules in 
which R73 (R72 in YidC; Fig. 2c) or the conserved glutamine residues 
had been mutated, by using in vivo genetic analyses. All tested mutations 
of R73 abolished MifM insertion activity, except for replacement of 
arginine with lysine (R73K), which slightly decreased the insertion activity 
(Fig. 2d). None of the R73 mutants, except for R73K, complemented the 
growth of wild-type SpolIIJ- and wild-type YidC2-depleted cells (Fig. 2b). 
By contrast, mutation of Q83, Q140, Q187 or Q238 (Q82, Q142, Q187 
and Q254, respectively, in YidC; Fig. 2c) to alanine did not affect MifM 
insertion activity (Fig. 2e). A similar result was obtained for a chimaera of 
the Pf3 coat protein, another single-spanning membrane protein inserted 
by the Sec-independent pathway, and the cytoplasmic region of MifM 
(Fig. 2fand Extended Data Fig. 9a). Taken together, these findings high- 
light the importance of the positive charge in the groove for insertion 
of MifM and the Pf3 coat protein into the membrane by YidC, whereas 
the conserved polar residues are probably important for creating the 
hydrophilic environment in the groove. 
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Figure 2 | In vivo functional analyses of YidC. a, d-f, Efficiencies of MifM 
and Pf3-MifM fusion protein insertion into the membrane, as determined by 
LacZ activity (mean + s.d., n = 3) in spolIIJ mutant cells, as indicated (top). 
The accumulation of these proteins in the cell is shown, detected by anti-FLAG 
or anti-SpolIJJ immunoblotting (bottom). K248stop represents a spol 
derivative that has a stop codon at position 248. A92-126-GG represents a 
mutant in which the entire C1 region has been replaced by a glycine-glycine 


Several single-spanning membrane proteins, including MifM and the 
Pf3 coat protein, have acidic residues in their N-terminal extracellular 
tails. Thus, our results suggested that these acidic residues might interact 
with the arginine residue in the hydrophilic groove of YidC. To address 
this possibility, we examined the importance of these acidic residues by 
using an in vivo genetic analysis (Extended Data Fig. 9a). Mutating the 
acidic residues in the Pf3 coat protein hada less pronounced effect than 
mutating these residues in MifM; however, these mutations negatively 
affected the membrane insertion efficiencies of both MifM and the Pf3 
coat protein, thereby supporting our hypothesis (Extended Data Fig. 9b). 
To further confirm that direct interactions occur between the substrate and 
the hydrophilic groove, we performed an in vivo site-directed ultraviolet- 
radiation crosslinking analysis, using p-benzoy!-L-phenylalanine (pBpa)”. 
pBpa was introduced at Q187 or W244 in the groove and at A74 or 
1249 on the exterior surface of YidC (Fig. 2c and Extended Data Fig. 1). 
Escherichia coli cells coexpressing MifM and the pBpa variant of YidC 
were exposed to ultraviolet radiation, and products that were crosslinked 
with MifM were analysed by immunoblotting. We detected crosslinked 
products only at positions 187 and 244, suggesting that the groove can 
interact with MifM (Fig. 3). Taken together, these observations suggested 
that the site around R72 in the hydrophilic groove is the substrate-binding 
site, through recognition of the hydrophilic residues of the substrate such 
as the acidic residues in MifM and the Pf3 coat protein. 

On the basis of these structural and functional analyses, we propose 
a mechanism for the insertion of single-spanning membrane proteins 
with an acidic N-terminal extracellular region, such as MifM and the Pf3 
coat protein, that is mediated by monomeric YidC (Fig. 4). In this mech- 
anism, the substrate protein initially interacts with the C1 region (Fig. 4a) 
and is then transiently captured in the hydrophilic groove of YidC. This 
substrate binding can induce structural changes in the hydrophilic groove, 
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linker. A97-103/A114-120 represents a mutant in which both the CH1 

and CH2 helices have been shortened by seven residues. b, Growth 
complementation of Bacillus subtilis cells reliant on a chromosomal SpollIIJ 
mutant in the absence of isopropyl-f-b-thiogalactoside (IPTG). ¢, Close-up 
view of the hydrophilic groove of Bacillus halodurans YidC showing the side 
chains of the indicated residues. The corresponding residues in B. subtilis 
SpollJJ are indicated in parentheses. WT, wild type. 


to accommodate the various substrate proteins. In this context, the con- 
served arginine residue in the groove might participate in substrate recog- 
nition (Fig. 4b). Subsequently, the transmembrane region of the substrate 
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Figure 3 | Substrate binding to the hydrophilic groove. In vivo 
photo-crosslinking between the hydrophilic groove of YidC and MifM. The 
amino acid positions at which pBpa was introduced into YidC are indicated. 
Membrane proteins were purified by separation on a nickel affinity column and 
analysed by SDS-PAGE, and YidC-FLAG-Hisg and MifM were detected by 
anti-FLAG and anti-MifM immunoblotting, respectively. The ~40 kDa-bands 
are attributed to products in which YidC is crosslinked to MifM and to 
unidentified endogenous proteins, as indicated by YidC-X. The accumulation 
of MifM in the membrane is shown (bottom). 
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Figure 4 | Proposed model for membrane insertion of a single-spanning 
membrane protein. The obtained crystal structure probably represents the 
resting state before substrate binding. The hydrophilic region of the substrate 
would then be transiently captured in the hydrophilic groove of YidC 

(a, b), resulting in the substrate-bound state (c). Substrate release into the 


protein is released into the membrane, with the hydrophilic residues 
translocated to the extracellular side. The substrate release can be facil- 
itated by the hydrophobic interaction between the transmembrane region 
and the lipid aliphatic chains. The membrane potential could also facil- 
itate this process, by attracting the negative charge of the extracellular 
tail”*** (Fig. 4c). 

The above mechanism clearly explains the insertion of a certain class 
of membrane proteins: that is, single-spanning membrane proteins with 
an acidic N-terminal extracellular region. However, it cannot account 
for the YidC-mediated insertion of other classes of membrane protein. 
For example, for E. coli YidC, deletion of the C1 region or simultaneous 
substitution of five amino acids, including the conserved arginine resi- 
due, with serine, was not found to impair the insertion of an M13 pro- 
coat derivative, a double-spanning membrane protein”. Therefore, it is 
likely that these membrane proteins are inserted by a different mech- 
anism, which might involve either sites in the hydrophilic groove other 
than the arginine residue or transient oligomer formation by YidC. Fur- 
ther structural and biological studies are required to clarify the mecha- 
nism of the YidC-mediated insertion of these membrane protein classes. 


METHODS SUMMARY 


Histidine-tagged Bacillus halodurans YidC2 (YidC) was overproduced in Escherichia 
coli, solubilized from the membrane with n-dodecy1-f-b-maltoside and cholesteryl 
hemisuccinate and purified by successive nickel (Ni-NTA) chromatography and gel 
filtration chromatography steps. The histidine tag was then cleaved with tobacco etch 
virus (TEV) protease. Crystals were grown in a lipidic cubic phase, using monoolein. 
Diffraction data were collected on beamline BL32XU at SPring-8. The structure of 
YidC was determined by the multi-wavelength anomalous diffraction method, using 
amethyl-mercury-chloride-derivatized YidC (Y150C mutant) crystal and was refined 
to an Rwork/Reree Of 24.2%/25.9% at 2.4 A resolution. Molecular dynamics simula- 
tions of YidC in the explicit phosphoryl oleoyl phosphatidylethanolamine (POPE) 
lipid bilayer were performed with the NAMD program (version 2.8) for 1,000 ns. 
The MifM insertion activity of YidC was analysed by using a MifM-based assay 
in which the B-galactosidase (LacZ) activity of a YidC2-LacZ translational fusion 
increases in response to a defect in the SpolIIJ-dependent insertion of MifM. The 
growth complementation assay was performed using mutant Bacillus subtilis cells, 
in which the yidC2 gene on the chromosome was disrupted and the spolII] gene 
was mutated, with rescue by a plasmid harbouring the wild-type isopropyl-B-p- 
thiogalactoside (IPTG)-inducible spolIIJ gene. In vivo photo-crosslinking was per- 
formed using pBpa. B. subtilis MifM and pBpa variants of YidC were overexpressed 
in E. coli, and the cells were exposed to ultraviolet radiation. YidC was then purified 
by Ni-NTA chromatography, and products that were crosslinked with MifM were 
detected by immunoblotting. See Methods for further detail. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Cloning, expression and purification of YidC for structure determination. Bacillus 
halodurans YidC2 (YidC) was cloned into a pET-modified vector’. The two resul- 
tant plasmids, encoding YidC,_2¢-Hg-ENLYFQGQ- YidC37_266 (YidC27_266) and 
YidC;_26-ENLYFQGQ-YidC7_267-LESSV-ENLYFQGQ- green fluorescent protein 
(GFP)-Hg (YidC,7_267), were expressed in Escherichia coli C41(DE3) cells harbour- 
ing pRARE (Novagen), and the proteins were purified using the following protocol. 
The cells were grown ina 51 culture at 37 °C to an absorbance at 600 nm of 0.7, and 
expression was induced with 1 mM isopropyl-B-p-thiogalactoside (IPTG) at 15°C 
for 16h. The cells were then harvested by centrifugation (4,500 g, 10 min, 4°C), 
resuspended in buffer containing 20 mM Tris-HCl, pH 8.0, and 0.1 mM phenylmethyl- 
sulphonyl fluoride, and disrupted by two passages through a Microfluidizer pro- 
cessor (Microfluidics) at 15,000 p.s.i. After removal of the debris by centrifugation 
(12,000 g, 30 min, 4 °C), the supernatant was ultracentrifuged (138,000 g, 1 h, 4 °C) to 
pellet the membranes, which were then solubilized in a buffer containing 300 mM 
NaCl, 20 mM Tris-HCl, pH 8.0, 20 mM imidazole, 1% n-dodecyl-f-D-maltoside 
(DDM) and 0.1% cholesteryl hemisuccinate (CHS). Insoluble materials were removed 
by ultracentrifugation (138,000 g, 30 min, 4 °C), and the supernatant was mixed with 
5 ml Ni-NTA Superflow (Qiagen). After binding for 1h, the resin was washed with 
300 mM NaCl, 20 mM Tris-HCl, pH 8.0, 20 mM imidazole, 0.1% DDM and 0.01% 
CHS, and YidC was eluted with the same buffer supplemented with imidazole to a 
final concentration of 300 mM at 4 °C. The N-terminal residues and the His. tag or 
GFP-Hisg tag were cleaved by a His-tagged tobacco etch virus (TEV) protease, and 
the sample was reloaded onto the Ni-NTA column (5 ml) to remove the protease. 
The flow-through fraction containing YidC was collected, concentrated and further 
purified on a Superdex 200 10/300 GL column (GE Healthcare) in 300 mM NaCl, 
20 mM Tris-HCl, pH 8.0, 0.1% DDM and 0.01% CHS at 4 °C. For crystallization, 
the purified protein was concentrated to 6 mg ml ' with a centrifugal filter device 
(Millipore, 50 kDa molecular weight cutoff) and dialysed against the following buffer 
(1 mM Tris-HCl, pH 8.0, 0.05% DDM and 0.005% CHS). For mercury derivatiza- 
tion, the Y150C YidC7_2¢s mutant was purified and incubated with 2 mM methyl 
mercury chloride for 1h before crystallization. 

Crystallization and heavy-atom derivatization. The protein was mixed with mon- 
oolein in a 2:3 protein to lipid ratio (w/w), using the twin-syringe mixing method”’. 
Aliquots (50 nl) of the protein lipidic cubic phase (LCP) mixture were spotted on 
a 96-well sandwich plate and overlaid with 800 nl precipitant solution using a mos- 
quito LCP crystallization robot (TTP LabTech). The crystals of YidCz7_266 and Yid 
Cy7-267 Were grown at 20 °C in reservoir solutions containing 28-32% poly(ethy- 
leneglycol) 500 dimethylether (PEGSO00DME), 2.5 mM CdCl, and 100 mM Na- 
(CH3)2AsOs, pH 6.0; and 24-26% PEGSO00DME, 10 mM CuCl, 200 mM NH,COOH 
and 100 mM MES-NaOH, pH 6.0, respectively. The heavy-atom derivatized crys- 
tals were obtained by co-crystallization of the Y150C mutant and methyl mercury 
chloride in the same reservoir solution used for the YidC27_¢¢ crystals. The crys- 
tals grew to full size in 2-3 weeks. The crystals were flash-cooled, using reservoir 
solution supplemented with 20% PEG500DME and 20% glycerol as a cryoprotec- 
tant and then stored in liquid nitrogen. 

Data collection and structure determination. X-ray diffraction data sets were 
collected by the helical data collection method on beamline BL32XU at SPring-8, 
using a micro beam with a 1-j1m width anda 10-1m height”. Diffraction data were 
processed using the program HKL-2000 (HKL Research) or the program XDS”. 
One Hg atom site was identified with the program SHELXD”. The initial phases 
were calculated using the program SHARP”’, followed by solvent flattening with 
SOLOMON”. The main chain was traced by automated model building using the 
program RESOLVE”. The model was further built manually using COOT™ and 
refined using PHENIX”. The structures of YidC7_266 and YidC7_267 were deter- 
mined by molecular replacement, using the program PHASER®*. The Ramachandran 
plots were calculated with the program RAMPAGE”. Data collection and refine- 
ment statistics are provided in Extended Data Table 1. The YidC.7_3¢6 crystal con- 
tains one molecule in the asymmetric unit. The YidC,7_267 crystal contains two 
molecules in the asymmetric unit (Mol A and Mol B), but these molecules do not 
form a face-to-face dimer. The figures depicting the molecular structures were pre- 
pared using CueMol (http://www.cuemol.org/). 

Bacterial strains and plasmids for in vivo functional analysis. The B. subtilis strains, 
plasmids and DNA oligonucleotides used are listed in Supplementary Tables 1-3 
and were constructed as described in the Supplementary Methods. 
B-Galactosidase activity assay and immunoblotting. B. subtilis cells were cultured 
at 37 °C in LB or CH medium. Aliquots (500 il) of cultures at an optical density at 
600 nm of ~0.5 were harvested and used for B-galactosidase activity assays and immu- 
noblotting. The B-galactosidase activities were measured as described previously**». 
For immunoblotting, a 500 il aliquot of the culture was mixed with 56 pl 50% tri- 
chloroacetic acid and incubated on ice for at least 5 min. The cells were precipitated 
by centrifugation (4 °C, 15,000 r.p.m., 5 min), washed with 1 ml 1 M Tris-HCl, pH 8.0, 
resuspended in SB buffer (33 mM Tris-HCl, pH 8.0, 40% sucrose and 1 mM EDTA) 
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containing 1 mg ml * lysozyme (Sigma) and incubated at 37 °C for 10 min. The 
cells were then solubilized by adding an equal volume of 2X SDS loading buffer 
(4% SDS, 0.1 M Tris-HCl, pH 6.8, 30% glycerol and 10 mM dithiothreitol (DTT)) 
and subjected to immunoblotting, using either anti-FLAG (Sigma) or anti-SpolIIJ 
antibodies, as described previously*®. Antiserum production is described in the 
Supplementary Information. 

Growth complementation assay. B. subtilis cells were cultured at 37 °Cin LB medium 
containing 100 ig ml’ spectinomycin and 1 mM IPTG. Aliquots (3.5 il) of serially 
diluted (10~ '-10~°), fully grown cultures were spotted on spectinomycin-containing 
LB agar plates with or without 1 mM IPTG and incubated at 37 °C for 15h. 

In vivo photo-crosslinking assay. B. halodurans yidC-FLAG(Sigma-Aldrich)-Hisg 
and B. subtilis mifM were cloned into the NcoI and BamHI sites of MCS1 and the 
Ndel and Xhol sites of MCS2 in pETDuet (Novagen), respectively. An amber muta- 
tion, TAG, was introduced into yidC by site-directed mutagenesis. mifM-deletion 
plasmids were generated by restriction digestion of the plasmids with SalI and Xhol 
and then ligation. E. coli BL21(DE3) cells harbouring two plasmids, pEVOL-pBpF 
(Addgene) and a pETDuet-based plasmid expressing YidC and MifM, were grown 
at 37 °C in M9-glucose medium supplemented with 1 mM pBpa” and appropriate 
antibiotics until mid-logarithmic phase, and then induced with 1mM IPTG for 
30 min. Portions of the culture (1 ml for isolation of total membrane and 2 ml for 
purification of YidC by Ni-NTA chromatography) were transferred to a dish and 
exposed to ultraviolet radiation (365 nm) for 5 min, by using a B-100AP ultraviolet 
lamp (UVP) ata distance of 5 cm. The irradiated cells were collected by centrifuga- 
tion at 7,000 gfor 1 min at 4 °C, suspended in 300 ll 10 mM Tris-HCl buffer, pH 8.0, 
containing 1 mM EDTA-Na and 0.1 mM 4-(2-aminoethyl)benzenesulphony] fluo- 
ride hydrochloride, and disrupted by freeze-thawing and sonication (Qsonica) with 
cooling on ice. Cellular debris was separated by centrifugation at 9,000 g for 1 min 
at 4°C and used for the isolation of the total membrane fraction or for the puri- 
fication of YidC. The total membrane fraction was isolated by ultracentrifugation 
at 100,000 g for 20 min at 4 °C. For the purification of YidC, the membrane proteins 
were solubilized in SC buffer containing 20 mM Tris-HCl, pH 8.0, 300 mM NaCl, 
20 mM imidazole-HCl, pH 8.0, 1% DDM, 0.1% CHS and 0.1 mM 4-(2-aminoethyl) 
benzenesulphony]l fluoride hydrochloride for 30 min at 4 °C. Insoluble materials 
were removed by ultracentrifugation at 100,000 g for 20 min at 4 °C. The super- 
natant was mixed with 0.1mL Ni-NTA agarose (Qiagen) that had been pre- 
equilibrated with SC buffer. After binding for 30 min at 4 °C, the resin was washed 
with SC buffer containing 0.1% DDM, and then YidC was eluted with SC buffer 
supplemented with 300 mM imidazole-HCl (pH 8.0). Proteins were separated by 
SDS-PAGE and detected by immunoblotting using anti-FLAG (Sigma) or anti- 
MifM* antibodies. 

Molecular dynamics simulations. The simulation system included YidC, phos- 
phoryl oleoyl phosphatidylethanolamine (POPE), water molecules and 150 mM NaCl. 
At first, the position and orientation of YidC in the POPE lipid bilayer were optimized 
using implicit solvent and membrane models (T.M. and Y. Sugita, manuscript in 
preparation). Next, the disordered region in the C2 loop (residues 200-213) was 
modelled, using the program MODELLER". The missing atoms, including hydro- 
gens in the protein, were built with the program VMD”. Finally, the periodic bound- 
ary system, including the explicit solvent and the POPE lipid bilayer*’, was prepared. 
The resultant size of the simulation box was 96 A X 96 A X 96 A. The net charge of 
the solute was neutralized by adding chloride and sodium ions. The molecular 
topologies and parameters from CHARMM36 were used”. 

Molecular dynamics simulations were performed with the program NAMD (ver- 
sion 2.8)**. The system was first energy minimized for 1,000 steps with fixed posi- 
tions for the non-hydrogen atoms, and then for another 1,000 steps with 10 kcal mol” : 
restraints for the non-hydrogen atoms. Next, we performed a long equilibration 
run of 50 ns in the canonical (NVT) ensemble (300 K, 96 A X 96 A X 96 A volume), 
with 10 kcal mol” restraints for protein non-hydrogen atoms and 0.1 kcal mol” 
restraints for water molecules, to optimize the locations of the lipid molecules around 
the protein, especially around the hydrophilic groove. Finally, equilibration was per- 
formed for 5 ns in the isothermal-isobaric (NPT) ensemble (300 K, 1.0 atm), with 
10 and 0.1 kcal mol~! restraints for the protein main chain and side chain atoms, 
respectively. The production process was performed for 1,000 ns. Constant tem- 
perature was maintained by using Langevin dynamics. Constant pressure was main- 
tained by using the Langevin piston Nosé-Hoover method. Long-range electrostatic 
interactions were calculated using the particle mesh Ewald method”. 
Determination of the molecular mass of YidC in detergent solution by using 
SEC-MALLS. The instrument set-up used for the SEC-MALLS experiment con- 
sisted of a Prominence HPLC system (Shimadzu) with an SPD-20A ultraviolet absor- 
bance detector connected in series with a DAWN HELEOS II light-scattering detector 
(Wyatt Technology) and an Optilab T-rEX refractive index detector (Wyatt Tech- 
nology). Analytical size exclusion chromatography was performed using a Superdex 
200 10/300 column (GE Healthcare) equilibrated with buffer containing 300 mM 
NaCl, 20 mM Tris-HCl, pH 8.0, 0.1% DDM and 0.01% CHS. Purified YidC27_266 
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(60 1g) was injected onto the column, and the elution was monitored in-line with 
three detectors. A 658.0-nm wavelength laser was used in the light scattering exper- 
iment. The data were corrected for the volume delay of ultraviolet radiation between 
the other detectors and were analysed using ASTRA software (Wyatt Technology). 
The molecular masses of the protein—micelle complex, the micelle and the protein 
were determined as described previously*”’. The refractive index increment (dn/dc) 
in which n is the refractive index and c is the concentration of the mixture of DDM 
and CHS in buffer, containing 300 mM NaCl and 20 mM Tris-HCl, pH 8.0, was de- 
termined offline using an Optilab T-rEX refractive index detector with a 658.0-nm 
wavelength laser, as described previously”. 
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YidC27-267 : residues 27-267 
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Extended Data Figure 1 | Multiple amino acid sequence alignment of YidC —_ molecules are highlighted in red boxes, and highly conserved residues are 
proteins. Sequence alignment of Bacillus halodurans YidC2 (BhYidC2), indicated by red letters. The hydrophilic and bulky residues that were mutated 
B. halodurans YidC1 (BhYidC1), Bacillus subtilis SpoIlJ (BsSpolllJ), B. subtilis and the pBpa positions introduced into B. halodurans YidC2 are indicated 
YidC2 (BsYidC2) and Escherichia coli YidC (EcYidC). The secondary structure __ by grey, green and blue triangles, respectively. The spoIIIJ K248stop derivative 
of YidC,7_266 is indicated above the sequences. The a-helices (as described in _hasa stop codon introduced at position 248, as indicated, and thereby expresses 
the main text) and B-strands (ES1 and ES2 in the E2 region) are indicated by _a SpolIIJ mutant that lacks the C-terminal 14 residues. 

cylinders and arrows, respectively. Strictly conserved residues among the five 
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Extended Data Figure 2 | Electron density map of B. halodurans YidC. weighting factor. Fo and Fc are the observed and the calculated structure factor 
Stereo view of the 2mFo - DF¢ electron density map of the TM2 helix, amplitudes, respectively. 
contoured at 1.10, where m is the figure of merit and D is the SIGMA-A 
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Molecular Mass (YidC) 
Molecular Mass (Micelle) 
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Extended Data Figure 3 | Monomeric B. halodurans YidC. a, The crystal lines in the LS chromatogram indicate the calculated molecular masses of the 
packing of YidC37_5¢, viewed from the plane of the membrane. The molecule _ protein-detergent complex and the protein, respectively. d, The RI of the 

in the asymmetric unit is coloured red. b, The crystal packing of YidCz7_267, mixture was measured in response to five concentration steps. The refractive 
viewed from the plane of the membrane. Two molecules (Mol A in light pink index increment (dn/dc) of the mixture of DDM and CHS was determined 
and Mol B in light blue) are in the asymmetric unit. c, The chromatograms using linear regression of the RI versus the concentration. e, The molecular 
show the ultraviolet (UV), refractive index (RI) and light scattering (LS) mass values determined by SEC-MALLS and calculated from the amino 
detector readings. The volume delays of UV between the other detectors were —_ acid sequence. 

corrected. The traces were normalized to the peak maxima. The green and blue 
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YidC27-266 
Extracellular YidCz7-267 Mol B 


Extended Data Figure 4 | Structural flexibility of the hydrophilic groove YidC,7_67 structure, the arrangement of the Cl region with respect to the 
and Cl region. a, b, Superimposition of the crystal structures of YidCy7_266 core region is rotated by ~35° compared with that in the YidCj7_9¢¢ structure. 
(coloured) and Mol B of YidC 7_267 (grey), viewed from the plane of the As a result, the tip of the C1 region is displaced by ~10 A in the YidCy7_67 
membrane (a) and from the extracellular space (b). The conformational structure. d, Close-up views of the hydrophilic groove (left, YidC7_266; right, 
changes observed in CH1 and CH2 are indicated by black arrows. ¢, Close-up Mol B of YidC 7-267). The distances between the Cx atoms of C136 and M221 
view of the Cl region. The side chains of P111 are shown by stick models. Inthe _ are indicated by dashed lines. 
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Extended Data Figure 5 | The hydrophobic core of B. halodurans YidC. red, representing 30 to 140 A’. b, c, Stereo views of the hydrophobic core, 
a, The crystallographic B-factors are coloured ina gradient varying from blueto _ showing the side chains of the hydrophobic residues. 
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the simulation. The secondary structure of YidC is indicated below the line. 
c, The water probability density map in the simulation, contoured at 0.001 


molecules A *ns '. 


Extended Data Figure 6 | Molecular dynamics simulation of B. halodurans 
YidC for 1,000 ns in a lipid bilayer. a, Snapshots of the structure over the time 
course of the simulation at 400 ns intervals: 0 ns (blue), 400 ns (magenta) and 
800 ns (light green). b, Root mean square fluctuation (r.m.s.f.) of YidC during 
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Extended Data Figure 7 | Gene structures of the strains and YidC mutants 
used for in vivo genetic analyses. a, Schematic representations of the gene 
structures of the yidC2-lacZ reporter strains used for the MifM-based assay. 
spollIJ*-flag indicates either wild type or mutant spolIIJ. yidC2-lacZ represents 
a translational gene fusion with the lacZ sequence in-frame after the sixth 
codon of yidC2. The native mifM-yidC2 on the chromosome remained intact. 
b, Deleted regions of SpolIIJ, viewed from the extracellular side. Residue 
numbers in SpollIJ are indicated. A92-126-GG represents a mutant in 
which the entire C1 region has been replaced by a glycine-glycine linker. 
A97-103/A114-120 represents a mutant in which both the CH1 and CH2 
helices have been shortened by seven residues. c, Schematic representations of 
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A159-167 


A97—103/A114—120 


A92-126-GG 


_— 


the gene structures used for growth complementation assays. SpolIIJ becomes 
essential for the growth of B. subtilis when yidC2 is disrupted. Cells with a 
disruption of chromosomal yidC2 were transformed with the rescue plasmid 
pCH1805, which expresses wild-type spolIJJ-flag under the control of the 
IPTG-inducible Pspac promoter. The native spolIIJ on the chromosome was 
replaced by either wild-type or mutant spolIIJ (spoIIIJ*-flag). In the absence of 
IPTG, spollIJ-flag is not expressed from the plasmid, making the chromosomal 
spollIJ*-flag the only source of cellular YidC. The complementation test 
measures the global role of SpolIIJ in inserting a wide range of membrane 
proteins, including single-spanning and multi-spanning membrane proteins. 
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Extended Data Figure 8 | Schematic explanation of B-galactosidase activity 
assay and MifM insertion activity. a, b, MifM is a single-spanning membrane 
protein, and its membrane insertion is considered to be mediated by YidC 
(SpollIJ)". To evaluate the MifM insertion activity of SpollIJ, we performed a 
genetic analysis using B. subtilis. In B. subtilis, SpoIIIJ is constitutively 
expressed, whereas YidC2 is expressed only when the SpolIIJ activity is 
compromised, by the following mechanism. The expression of yidC2 is 
regulated by the upstream cis regulator open reading frame of mifM, which 

is co-transcribed with yidC2. During the synthesis of MifM, the C-terminal 
region of nascent MifM interacts with the peptide exit tunnel of the ribosome 
and causes translational arrest**°°. When the SpollIJ activity is normal, the 
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translational arrest is released by the SpolIIJ-dependent membrane insertion of 
MifM. Therefore, the translational arrest is transient or does not occur (a). 
By contrast, when SpollJJ activity is compromised, MifM is not inserted into 
the membrane, and its translation is arrested, which causes ribosome stalling. 
The stalled ribosome disrupts the downstream stem-loop structure and 
exposes the Shine-Dalgarno (SD) translation initiation signal sequence of the 
yidC2 messenger RNA (b). Thus, we can estimate the in vivo SpollIJ activity 
by measuring the expression of the introduced yidC2-lacZ fusion (Extended 
Data Fig. 7a): the reduction of MifM insertion efficiency by SpolIIJ elevates the 
LacZ activity”*°. 
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Extended Data Figure 9 | Effects of N-terminal negatively charged residues 
of substrates on insertion. a, Schematic representations of the N-terminal 
negatively charged residues of MifM and the Pf3—MifM chimaeric protein. 

b, Membrane insertion efficiencies of MifM mutants and Pf3—MifM mutants. 
The efficiencies were determined by the LacZ activities (mean + s.d., n = 3). 
The N-terminal negatively charged residues of MifM and Pf3—MifM and the 
numbers of the charged residues are shown at the bottom (EDD, wild-type 
MifM; DD, wild-type Pf3-MifM). Mutations of the acidic residues in the Pf3 
coat protein had less pronounced effects than mutation of those in MifM, 
probably because the membrane insertion is facilitated by multiple interacting 
factors depending on the amino acid sequence. 
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Extended Data Table 1 


Data collection and refinement statistics 


YidC27-266 YidC27-267 YidC27-266 (Y150C) 
(Methyl HgCl) 
Data collection 
Space group P24 P24 P21 
Cell dimensions 
a, b, c(A) 43.9, 60.6,58.9 54.0, 70.1,83.1 43.8, 59.7, 58.6 
a, B, y (°) 90, 100.3, 90 90, 92.0, 90 90, 100.3, 90 
Peak Inflection 
Wavelength 1.000 1.378 1.00000 1.00945 
Resolution (A) 50-2.40 45-3.20 50-3.00 50-3.00 
(2.44-2.40) (3.40-3.20) (3.05-3.00) (3.05-3.00) 
Rsym 0.068 (0.318) 0.110 (0.472) 0.089 (0.350) 0.101 (0.456) 
Nol 22.5 (2.85) 10.1 (2.96) 29.3 (4.72) 21.5 (2.75) 
Completeness (%) 91.0 (85.6) 95.6 (99.4) 99.3 (98.3) 99.2 (98.3) 
Redundancy 2.6 (1.9) 3.9 (3.8) 5.8 (4.4) 5.4 (3.5) 
Refinement 
Resolution (A) 2.40 3.20 
No. reflections 10,879 19,127 
Rwork/Riree 24.2/25.9 26.0/29.1 
No. atoms 
Protein 1,601 3,586 
Ligand/ion 60 4 
Water 8 
B-factors 
Protein 65.24 96.2 
Ligand/ion 77.73 121.0 
Water 49.43 
R.m.s deviations 
Bond lengths (A) 0.002 0.002 
Bond angles (°) 0.714 0.765 
Ramachandran plot 
Favoured (%) 99.5 96.1 
Allowed (%) 05 3.6 
Outliers (%) 0.0 0.2 


*The numbers in parentheses are for the highest resolution shell 
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Three-dimensional printing is opening up applications in many areas, including research. 


TECHNOLOGY 


Building 


opportunities 


More jobs and research positions are being created as 


interest in 3D printing grows. 


BY NEIL SAVAGE 


iril Vidimée began his professional life 

kK as a software graphics engineer in Cali- 
ornia — he spent eight years at Pixar 
Animation Studios in Emeryville and another 
four at chip-maker Intel in Santa Clara. But he 
wanted to move beyond the world of animated 
movies and video games into something that 
existed outside of a video screen. So in Febru- 
ary 2012, he joined the Computational Fab- 
rication Group at Massachusetts Institute of 


Technology in Cambridge. He's now pursuing 
a PhD in computational fabrication and writ- 
ing software that tells three-dimensional (3D) 
printers how to build complicated objects. 
“Tm excited about working on physical 
things for a change. I’ve never been a maker 
myself, but I always wanted to be. I wanted to 
play with more physical stuff? Vidimée says. 
“T think it’s fun to get to the intersection where 
you're combining knowledge of computer 
science and electronics and materials.” 
Indeed, 3D printing, also known as 
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additive manufacturing, is a growing field 
that draws researchers from a variety of dis- 
ciplines — computing, materials science, 
mechanical and electrical engineering, phys- 
ics, chemistry, and even biology. They are in 
demand to develop the technology into some- 
thing that might change how manufacturing is 
done and create all sorts of products that have 
not been possible before, from lighter-weight 
aircraft parts to cubes that unfold into furni- 
ture to ‘biobots’ made with living cells. 

Wohlers Associates, a consulting firm in Fort 
Collins, Colorado, that focuses on the industry, 
says that the worldwide market for 3D printing 
products and services jumped to US$3.1 billion 
in 2013 (a rise of 35% from 2012), and expects 
it to grow to $10.8 abillion by 2021. That will 
certainly open up more jobs in the field, says 
company president Terry Wohlers. “As the 
industry grows, so will the need for designers, 
scientists and researchers — especially chem- 
ists — to develop new materials,” he says. “Also, 
we'll need mechanical and electrical engi- 
neers to develop new-generation 3D printers, 
manufacturing engineers to develop processes 
and supply chains and educators to introduce 
all of this to a new generation of designers, 
technicians and others’ 

Some of those jobs will be in the 30 or so com- 
panies that manufacture the printing systems 
in Europe, the United States, China and Japan. 
“The prospects of employment are very good,” 
says Cathy Lewis, chief marketing officer at one 
of those companies, 3D Systems in Rock Hill, 
South Carolina, which lists numerous open- 
ings for engineers of different stripes. She says 
it's hard to project exact numbers of future hires, 
but points out that the company has grown from 
300 employees four years ago to 1,500 today, 
many of them in technical positions. 

Governments around the world are pour- 
ing money into research projects intended to 
advance the technology of 3D printing from 
a niche field to a major part of manufacturing, 
providing funds to encourage partnerships 
between universities and industry. In 2012, 
for instance, the US government launched the 
National Additive Manufacturing Innovation 
Institute, since dubbed America Makes, based 
in Youngstown, Ohio, with $30 million in gov- 
ernment funding and another $40 million from 
industry. The institute has given out $13.5 mil- 
lion in matching funds to research teams from 
both industry and academia. The US National 
Science Foundation, the Defense Advanced 
Research Projects Agency and NASA are all 
also funding 3D printing research. 
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> In 2013, the Singapore government 
announced that it would invest $$500 million 
(US$400 million) over five years in research 
into 3D printing and other innovative tech- 
nologies such as robotics, and the Chinese 
government intends to invest $240 million 
in 3D printing over three years. Last summer, 
the UK’s Technology Strategy Board and the 
Research Councils announced £8.4 million 
(US$14.22 million) in such funding. It is hard 
to say exactly how many jobs all that translates 
into, but David Bourell, a mechanical engineer 
and materials scientist who runs the Labora- 
tory for Freeform Fabrication at the University 
of Texas, Austin, says he hears of universities 
hiring more faculty who focus on 3D printing. 
“There is an upswing in the past few years as 
public and governmental interest in 
the field has increased,” he says. a9) aoe 
“That’s the topic they want to \gas. 
hire in.” In part, he says, the surge 

in interest is being driven by the 
expiration of the founding patents for 
3D-printing technologies, meaning that 
it is now less expensive for companies who 
want to get involved in the field. 


VALUE ADDED 

For his part, Vidimée took an idea from 
computer-graphics rendering and used it 
to create OpenFab, computer-aided design 
software for building an object from more than 
one material. A computer model describing 
the 3D shape of an object, as well as the specific 
areas where different materials go, would be so 
large that it could halt the printing process. So 
instead of trying to model the whole object at 
once, Vidimée’s programme breaks it up into 
small sections, and provides the printer with 
a recipe for what to do in each section. That is 
the same way that graphics software renders 
complex images in a video game, treating 
them asa series of small problems instead of 
one large one. 

At 3D Systems, Mehdi Mojdeh, director of 
3D printing and a technical fellow, says that 
much of the company’s research is aimed at 
improving the speed and accuracy of the pro- 
cess. Although industry has been using the 


Jewellery makers benefit from 3D printing. 


522 | NATURE | VOL 509 | 22 MAY 2014 


CASE STUDY 


Learning about 3D printing 


@ 3D Systems in Rock Hill, South Carolina, 
offers internships at various locations, 
including New York, Los Angeles and 
Boston, for people thinking about pursuing 
research in 3D printing to get some first- 
hand experience with the machines. 
http://www.3dsystems.com 
@ Many universities have 3D printers in one 
research lab or other. And hackerspaces, 
which provide the public with various types 
of equipment to build things, usually have 
one, and will often offer training in how to 
use it. 

http://hackerspaces.org/wiki 

@ The website ‘Rapid Today’ 

provides a list of degree 
programmes that include 3D 
printing. 
http://www.rapidtoday.com/get_prototyping_ 
education.html 
@ The online Rapid Prototyping 
Journal gives an entry into the 


technology for a while to design 
and prototype new products, “we 

are trying to push 3D printing more into a 
manufacturing and production environment; 
he says. 

General Electric (GE) in Fairfield, Connect- 
icut, is increasingly using the technology for 
building parts that go into its products, from 
refrigerators to jet engines. The company has 
more than 500 employees working directly 
on 3D printing, and Stephan Biller, GE’s chief 
manufacturing scientist, says that this is likely 
to increase. “There’s no doubt about it, addi- 
tive manufacturing is going to change GE and 
it's going to change manufacturing as a whole,” 
Biller says. “I don't think people will have trou- 
ble finding a job in this area” 

One of the greatest needs, Mojdeh says, is 
for researchers who focus on the properties of 
materials. Companies would like to expand the 
range of material properties such as strength, 
hardness, conductivity and so on, that are avail- 
able. The wider the range of properties, the 
more types of objects that can be built, and the 
more functions such objects can have. Another 
key need is for scientists who work on an ulti- 
mate goal of the industry: multi-material print- 
ing, or the building of devices made of different 
substances — say, a ceramic engine part with 
metal circuitry built in. Very few products, after 
all, consist of only a single material, but today’s 
3D machines can print with only two different 
polymers at most. 

Those goals mean that the industry needs 
both researchers who can develop materials 
with desirable characteristics, and those that 
can make the various printing technologies 
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literature on current and past research in 3D 
printing. 

http://www.emeraldinsight.com 

@ Students could consider attending 
conferences to hear reports of current 
research and to talk to people in the field. 
@ The Additive Manufacturing and 3D 
Printing International Conference in 
Nottingham, UK, on 7-11 July. 
http://www.am-conference.com/ 

@ The 25th Annual International Solid 
Freeform Fabrication Symposium in Austin, 
Texas, on 4-6 August. 
http://sffsymposium.engr.utexas.edu/ 

@ The Inside 3D Printing Conference 

and Expo is a business-to-business trade 
show with several shows a year around 
the globe. 
http://www.mediabistro.com/inside3dprinting/ 

@ The Rapid Prototyping Calendar lists a 
variety of upcoming conferences. 
http://www.additive3d.com/conf.htm N.S. 


work with those new materials — as well as 
with each other. It is not yet possible to print 
metal and plastic in the same machine, for 
example, because the heat needed to fuse 
the metal would vaporize the plastic. To find 
solutions, researchers from different sci- 
ence and engineering disciplines will have to 
work together, because an advance in one area 
depends on — and affects — others. “If you 
continually develop the materials, then pro- 
cesses will have to change, and design systems 
will have to evolve, so it’s very much a moving 
research target,’ says Christopher Tuck, deputy 
director of the UK Engineering and Physical 
Sciences Research Council's Centre for Innova- 
tive Manufacturing in Additive Manufacturing 
at the University of Nottingham, UK. 

In February, the centre launched a new post- 
graduate degree programme, part of a grow- 
ing movement to train researchers to aim at 
those targets (see ‘Case study’). It also aims to 
prepare people from disparate backgrounds to 
apply their expertise to 3D problems — say, a 
biologist wanting to print artificial organs with 
living cells, or a polymer chemist creating a 
more durable plastic part. The centre is look- 
ing for 66 students for a PhD programme that 
begins in October. “We're going to be recruit- 
ing scientists and engineers from much more 
outside mechanical engineering. It’s going 
to be a lot of physics, chemistry, maybe a 
biologist,’ says Tuck. 

All the students will spend their first year 
at Nottingham, getting a grounding in how 
3D printing works, but they'll then special- 
ize at one of four institutions, depending on 
their area of interest. Nottingham will focus 
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on design, multi-material printing and 
polymers. Loughborough University will 
deal with hybrid printing technologies. 
The University of Liverpool will focus on 
research into metals, and Newcastle Univer- 
sity will specialize in additive manufacturing 
for biomedical uses. 


GOING GLOBAL 
Last September, Nanyang Technological 
University n Singapore opened the NTU 
Additive Manufacturing Centre. The cen- 
tre offers master’s degrees in precision 
engineering, mechanical engineering and 
manufacturing systems and engineering, 
all with a specialization in additive manu- 
facturing, as well as a PhD in mechanical 
engineering with a topic in additive manu- 
facturing. It is also providing internships to 
students from around the world. 

The creation of the centre has led to post- 
doc and faculty positions in several disci- 
plines, including 
materials science, 
mechanical engi- 
neering, chemical 
engineering and 
biomedical engi- 
neering, says Chua 
Chee Kai, who 
heads it. He says 
it is too early to 
predict how many 
jobs related to 


il 


3D printing there “Ify ou 

might be in the continually 
business world in develop the 
general, because materials, then 
the technology is processes will 

in the early stages have tochange.” 
of adoption. “We — Christopher Tuck 


do, however, 
firmly believe that these are high-value 
jobs that will change the landscape of that 
particular business or sector,’ he says. And 
as the field grows, he predicts, “we will 
need not just engineers, but also computer 
experts, mathematicians, software engineers, 
material scientists, quality experts, system- 
solution providers and many more to sustain 
the economic ecosystem of 3D printing” 
Bourell says that even institutions with- 
out formal programmes usually have some 
research going on. “You can pick almost 
any university and they have some kind of 
activity these days,” he says. “They have at 
least one person dabbling.” He expects the 
field to evolve and expand rapidly enough 
over the next 5 to 10 years for there to be 
considerable demand for people with mas- 
ter’s- and PhD-level education. “We're right 
on the edge of the wave,’ Bourell says. “Now 
it’s time to jump in.” = 


Neil Savage is a freelance writer in Lowell, 
Massachusetts. 


RESEARCH EFFICIENCY 


Time wasted 


US scientists are spending 42% of their 
time on bureaucratic chores, according 
to a US National Science Board (NSB) 
report. The NSB named financial and 
personnel reporting and over-detailed 
preliminary grant proposals as common 
complaints. Arthur Bienenstock, chair 
of the NSB task force that authored the 
report, says that the need to submit 

a detailed budget in the first round 

ofa grant proposal, for example, is 
‘nonsensical. The NSB found that an 
over-regulation culture is overwhelming 
US scientists, and recommends limiting 
proposal requirements to those essential 
for evaluating merit; keeping reporting 
focused on outcomes; and automating 
payroll certification. 


AWARDS 
Postdoc prizes 


Three science-award recipients 

based in New York City have pledged 
US$3 million of their prize money to set 
up an endowment to fund awards for 
postdoctoral researchers. Charles Sawyers 
of Memorial Sloan Kettering Cancer 
Center, Cori Bargmann of Rockefeller 
University and Lewis Cantley of Weill 
Cornell Medical College last year each 
won $3-million ‘Breakthrough prizes. 
The new Tri-Institutional Breakout 
Awards for Junior Investigators will offer 
three to six $25,000 awards per year. 

The endowment founders say they are 
donating part of their prizes because they 
believe that postdocs will benefit more 
from the support. “For me, a prize at that 
stage of my career would have been really 
helpful, says Bargmann, who adds that 
postdocs do much of the most creative 
work in science. Inaugural winners will be 
announced by the end of the year. 


MULTIPLE DISCIPLINES 
Converging futures 


Exciting future research opportunities 
will arise where disciplines converge — 
but government funding and academic 
structures are ill prepared to take 
advantage of them, says a report from the 
US National Research Council. Report 
committee chairman Joseph DeSimone 
says that this convergence will require 
scientists to understand the language of 
multiple disciplines and sectors, and calls 
3D printing (see page 521) a ‘wonderful 
example, as it depends on so many 
different disciplines. 
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Ua SCIENCE FICTION 


BY JOHN FRIZELL 


cc ou can't do experiments on my 
WY{ resin’ said Ellie. “I abso- 
lutely forbid it. It must be illegal, or 

unethical, or something.” 

“He volunteered,’ said Jamie. “He gets paid 
and all legal requirements are being met” 

“Does Mum know how much you're 
spending?” 

Although her little brother was a 
millionaire thanks to his inventions and 
patents, their mother controlled the 
money until he was 18. 

“She won't mind” 

“T love Claude. You better not hurt him” 

“You can always get another one. You 
change them all the time.” 

“So what is this about?” she said, as an 
alternative to slapping him. 

‘Invisible earphones.” 

“Well that’s good.” 

She liked the idea. No set of earphones 
ever made looked remotely cool. She 
loved listening to music but didn’t want 
to go around with earphones on all day 
looking like a loser. 

“How will you do it?” 

Jamie's gifts did not include the ability to 
explain his ideas to normal people. There 
was something about magnetosomes, tiny 
things with iron crystals in them. 

“They are the compass of homing 
pigeons,” said Jamie, and then immedi- 
ately contradicted himself by saying that 
what birds used was smaller and more 
subtle. “It may not work, but the point is 
that if these things can align themselves 
with a magnetic field then we can use a 
magnetic field to move them and induce 
signals directly into nerves.” 

“The point is that you want to inject tiny 
particles into Claude's brain. Well you can't” 

But he could. Ellie's appeal to their mother 
was in vain. 

“There's nothing to worry about, dear,’ she 
said. “Claude is just one of nine students from 
your university who signed up for the experi- 
ment. It’s being run bya licensed human test- 
ing organization and there’ a full review and 
risk assessment. Jamie showed me.” 

Ellie nodded and smiled. She always 
smiled at her mother when she was trying 

to change her mind. 


> NATURE.COM Ellie had googled the 
Follow Futures: firm and the individ- 
Y @NatureFutures ual doctors but found 
E} go.nature.com/mtoodm = nothing she could use 
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TRIALS 


An experimental direction. 


to stop the trials from going ahead. 

“But it’s a lot of money. Even Jamie says he 
doesn’t know if it will work” 

“Oh don’t you worry about that. The 
accountants will offset it against tax. It 
won't cost anything to try and it might be 
a big success.” 

There was no change in Claude. He had 


always been given to acts of spontane- 
ous generosity, and now that he had more 
money Ellie got more lovely surprises. 
But her brother was glum. The hidden 
collar magnets produced no sounds. Big- 
ger magnets did nothing. The volunteers 
had even put their heads into some sort of 
super magnet. None of them heard induced 
sounds. The trials had failed. She texted 
Jamie an invitation for coffee so she could 
cheer him up. 

She had Claude on her arm when they 
met. Jamie gave him the polite, unengaged 
smile that he gave to most people. 

“Shall we go to Coffee Delight?” she said. 

“Let’s go to Midnight Mocha” 

It was typical Jamie. CD was just down the 
block and it was bright, bustling and super 
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popular — everyone was there. Midnight 

Mocha was gloomy and usually empty. But 

it was what Jamie liked and she was doing 

this for him. She started down the well-worn 
route but Claude's strong warm arm steered 
her away from it. 

“Let’s go this way,” he said. 

They tooka funny route through the crisp 
autumn air, cutting through a park, walk- 
ing on the grass instead of following the 
paths and then took a sharp turn at the 
end, not the shortest way but nice. Jamie 
paced beside them, messing with his 
mobile phone. 

Ellie tried to cheer Jamie up over 
cappuccino, but he was even more intro- 
verted than usual. After one of Claude's 
jokes Jamie looked up. 

“When we walked here you were always 
heading 17° west of south,” said Jamie. 
“Why did you do that?” 

“Just walking,” said Claude. “At least we 
got here.” 

He took the opportunity to launch into 
a hilarious story that left Ellie doubled up 
with laughter and even brought a smile 
to Jamie’s face. When they left Jamie paid 
for everything: coffees, cakes and refills. 

“TL can charge it to the project; he said. 

It was a first. She must have succeeded 
in cheering him up; her tight-fisted 
brother never paid for anyone else. 

Two mornings later, tears were stream- 
ing down Ellie's face as she tried to eat her 
low-fat yogurt breakfast. She wiped them 
away as Jamie came into the kitchen. 

“Is Claude leaving you?” 

She sat up, astonished. Jamie could 
hardly tell if other people liked or dis- 
liked him. He had never had any sort of 

relationship in his life and was hopeless at 

picking up the most basic social cues. 
“How did you know?’ 
“The way he walked to Midnight Mocha. 

If you apply the magnetic declination to that 

heading, he was walking straight away from 

magnetic north. 
“But Jamie! He's quitting university. He's 
going to Florida. Pll never see him again!” 
“Do you love him enough to wait for him?” 
“Oh, Jamie. You know I do.” 
“Then it will be OK. He’s just migrating. 
Dont worry, he'll be back in the spring. m 


John Frizell was trained in biochemistry 
and works in ocean conservation for 
Greenpeace. In his spare time he walks, 
builds robots and sings. 
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